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TRANSCRIPTIONAL REGULATION IN PLANTS 

Field of the invention 

The present invention relates to meiosis in plants and 
more particularly the identification of regulatory 
sequences for meiosis-specif ic transcription of genes of 
interest and uses thereof . 

Background of the invention 

Meiosis provides a mechanism by which a heterozygous 
individual can create large numbers of genotypically unique 
recombinant gametes. Chromosomes replicate during 
interphase, as in mitosis, and enter meiosis with two 
chromatids. During meiotic prophase I, chromosomes condense 
from the dispersed state typical of interphase, to form 
long thin threads in leptotene, and each acquires a 
proteinaceous axial core to which the two sister chromatids 
are attached. During zygotene, homologous chromosomes 
become aligned, forming the synaptonemal complex and, at 
pachytene, non- sister chromatids of the completely paired 
chromosomes recombine forming the chiasmata which become 
visible during diplotene. Two cell divisions follow - 
reductional and equational - resulting in four gametes, 
with each single chromosome as a potentially recombinant 
chromatid. 

In yeast, molecular genetic analysis has revealed and 
led to the isolation of several genes that are essential 
for meiosis (Mitchell, 1995) . For example, the DMC1 (Bishop 
et al., 1992) and RAD51 (Shinohara et al . , 1992) genes are 
homologues of the E. coli recA gene, and appear to play a 
role not only in recombination-mediated homology- dependent 
pairing, but also in the strand exchange that results in 
chiasmata. Other genes, such as ZIP1, are required for 
synaptonemal complex formation (Sym et al . , 1993 ) . 

In higher eukaryotes, molecular analysis of the 
mechanisms controlling chromosomal pairing has been 
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significantly more difficult than in yeast, since the 
systems are more complex and less easy to manipulate. 
However, several meiotic mutations have been identified in 
Drosophila (for review see Carpenter, 1993; 1994) and 
higher plants (Sears, 1976; Golubovskaya , 1989; Curtis and 
Doile, 1992; Golubovskaya et al . , 1992, 1993; Tascheto and 
Pagliarini, 1993) and studied for their effect on meiosis. 
One such plant gene, Phi, suppresses pairing of 
nonhomologous chromosomes in wheat and a Phi -mutant 
background was used for the transfer and introgression of 
alien chromosome segments into wheat (Riley et al . , 1968). 

Plants have also provided an excellent cytological 
system for the study of meiosis (Gillies, 1984; Albini and 
Jones, 1987, 1988; Sherman and Stack, 1992; Albini, 1994; 
Schwarzacher and Heslop-Harrison, 1995) , but there has been 
little investigation at the molecular level. 

Lily anthers have offered the best system to study 
biochemical events that are correlated with the different 
stages of meiosis, because these stages are protracted and 
synchronised in adjacent flower buds, permitting the 
isolation of temporally regulated cDNA clones (Kobayashi et 
al . , 1994). One such gene, LIM15, was expressed 
specifically in prophase I of meiotic cells and is 
extremely homologous to the yeast DMC1 gene (Kobayashi et 
al.,. 1993) . 

In both yeast and lily, DMC1 and RAD51 proteins 
colocalize during zygotene (Bishop, 1994; Terasawa et al . , 
1995) . Early meiosis cDNA clones were also identified in 
wheat and maize by hybridisation to a Lilium meiosis- 
specific cDNA clone (Ji and Langridge, 1994) . 

The first genomic sequence of a recA-like plant gene, 
ArLiIMlB , with high degree of homology to that of LIM15, was 
recently described in Arabidopsis thaliana (Sato et al . , 
19 95) . However, no data showing the expression pattern of 
ArLIMIS, as well as no characterization of meiosis-specif ic 
promoter have so far been reported by others. Sequence of 
upstream region of ArLIMIS gene, shown in Figure 4 of 
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Klimyuk and Jones 1996 (see below) contains predominantly 
the sequence of transposon- like element, Limpet 1, (1874 bp) 
and only 260 bp of the promoter region, which is not 
sufficient to confer meiosis-specif ic expression of 
reporter gene. 

For many applications it will be useful to drive 
transcription of different genes in specific parts of the 
plant at specific developmental stages. The promoters of 
plant meiotic genes are of extreme interest as they may 
provide transcriptional regulation of their genes during 
this very restricted developmental period. The isolation 
and characterization of these promoters enables the study 
and modification of fundamental processes taking place 
during early sporogenesis, as well as to study the impact 
of such modifications on more advanced stages of sexual 
reproduction in plants. Work by the present inventors 
outlining their objectives has been shown as a poster 
display (Klimyuk, V.I. et al . "The isolation and 
characterisation of the meiosis-specif ic Arabidopsis 
thaliana DMC1 gene" . Abstracts of the 6th International 
Conference on Arabidopsis Research. June 7-11, 1995, 
Madison, Wisconsin, USA and Klimyuk, V.I. and Jones, J.D.G. 
"Identification of a transposon- like element, Limpet 1, in 
Arabidopsis thaliana" . Abstracts of the 7th International 
Conference on Arabidopsis Research. June 23-27, 1996, 
Norwich, UK) and their work has been presented in a paper 
published after the priority date of the present 
application (Klimyuk, V.I. and Jones, J.D.G. "AtDMCl, the 
Arabidopsis homologue of the yeast DMC1 gene : 
characterization, transposon- induced allelic variation and 
meiosis-associated expression 1 '. Plant J ( 1997 ) , 11 ( 1 ) , 1-14) 
which is incorporated herein by reference. 

Summary of the invention 

The primary aim of the inventors was to identify and 
isolate a plant promoter which confers meiosis-specif ic 
transcriptional regulation in plants. 
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They used the homologies between the lily LIM i 5 and 
the yeast DMC1 genes to design degenerate PGR (polymerase 
cham reaction) primers that amplified the Arabidopsis 
meiosis-specific DMC1 gene (designated AtDMCl) . The AtDMCl 
gene was completely sequenced and the transcript was 
characterised by RT(reverse transcriptase) -PCR. m situ 
hybridisation analysis showed that AtDMCl expression is 
restricted to pollen mother cells in anthers and megaspore 
mother cells in ovules. A translat ional fusion was made 
between the AtDMCl promoter and the GUS reporter gene 
Transgenic Arabidopsis carrying the AtDMCl promoter : GUS 
reporter gene fusion initiated GUS expression at the time 
of meiosis in both male and female lineages. 

Sequence comparison of AtDMCl and ArLIMIS (Sato et 
15 al., 1995) revealed that these genes, isolated from 

Landsberg erecta and Columbia ecotypes respectively, encode 
the same protein but are different in their promoter 
regions. We determined that the difference was caused by 
the insertion of a 1874 bp transposon-like element, 
designated Limpetl, into the promoter of ArLIMIS. This 
finding revealed that previously published putative 
promoter region of ArLIMIS (Sato et al . , 1995) 
predominantly contains Limpetl sequences and would not be 
expected to be sufficient to confer meiosis-specific 
expression of the GUS reporter gene. The sequence of 
functional AtDMCl promoter as well as the alignment of the 
promoter regions of the AtDMCl and ArLIMIS genes upstream 
of their transcription start sites are shown in Figure 4A 
and B, respectively. 

Reference herein to the AtDMCl promoter being used in 
various ways, in accordance with the present invention 
should be taken to be reference to a promoter including all 
or part of the promoter sequence shown in Figure 4 (A) , or 
a variant or derivative thereof, but excluding the promoter 
35 of AtLIMIS (Satoetal., 1995). Apart (fragment), variant 

or derivative of the promoter sequence shown should be 
sufficient to confer meiosis-specific expression on a 
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heterologous sequence operatively linked, i.e. under the 
control of, the part, variant or derivative of the sequence 
shown. One or more fragments of the sequence may be 
included in a promoter according to the present invention, 
5 for instance one or more motifs may be coupled to a 

"minimal" promoter. Such motifs may confer meiosis- 
specific promoter function on a promoter which otherwise 
drives expression in a non-meiosis specific fashion. 

According to a first aspect, the present invention 
TO provides a nucleic acid isolate comprising a promoter as 

indicated . 

In a second aspect, the present invention provides a 
nucleic acid isolate comprising a promoter, the promoter 
comprising a sequence of nucleotides shown in Figure 4 (A) 

15 and conferring meiosis specific expression on a sequence 

operably linked to the promoter. Restriction enzyme or 
nucleases may be used to digest the full-length nucleic 
acid shown, followed by an appropriate assay to determine 
the minimal sequence required for developmental 

20 specificity. A preferred embodiment of the present 

invention provides a nucleic acid isolate with the minimal 
nucleotide sequence shown in Figure 4 (A) required for such 
specificity . 

The promoter may comprise one or more sequence motifs 
25 or elements conferring developmental and/or tissue-specific 

regulatory control of expression. Other regulatory 

sequences may be included, for instance as identified by a 
mutation or digestion assay in an appropriate expression 
system or by sequence comparison with available 
30 information, e.g. using a computer to search on-line 

databases. 

By "promoter" is meant a sequence of nucleotides from 
which transcription of DNA operably linked downstream (i.e. 
in the 3' direction on the sense strand of double- stranded 
35 DNA) may be initiated. 

"Operably linked" means joined as part of the same 
nucleic acid molecule, suitably positioned and oriented for 
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transcription to be initiated 

operably linked tQ a ^ °» the promoter. DNA 

initiation regulation" of th P « transcriptional 
„, or the Promoter. 

The present invention extend, t- 

a nucleotide sequence which is ^AeTe " Pr ° m ° ter ^ 
derivative, by wav of , a "ele, mutant, variant or 

substitution or J le y tio ° n f of nU a ° tide ^tion, 

^rein. Systematic IT r^JLT^^ " 

to make an alteration genesis of nucleic acid 

Performed using any technTaul t nUCle ° tlde «^nce may be 

a any cectinigue known to thos P «,wn ^ • 
««• On. or more alterations . those skl11 ^ » the 



according to the • 3 P rom °ter sequence 

propter act iv - ty o rrnc: nVenti ° n "** '""^ °* 

.«.= t 0£ a y - s :: s as ; bl °e r t ? r 7* the °< 

15 activity. tQ ™° dulate the promoter 

"Promoter activity is used Co 
mrtrate transcription. The level of abUrty to 

quantifiable for instance h Promoter activity is 

mRNA produced by tran 7 "—"" nt ° £ «=•» -°™t of 
« assessment of £ ™ ^pro^T ^ " « «* 

session syste" £ ^ "V" 

specific a mp li ficaCion « r/ - used in a 

chain reaction. Use rt<r 33 the Polymerase 

determination of promotp * rep ° rter ^ne facilitates 

Production. ? " by -^rence to protein 

30 Therefore therp -i o 

comprising Provide * nuclei = «=id molecule 

9ene promot'lVor " 1 « i "-P"«ic «C1 (AtDHCa, 

3^ -t from al b i h ; r Tir;; p e e cr; :r° ter homolo9ous to <■> 

( c ) a meiosis-sDPri f-;^ ~ 
homologous DMC1 protein fro ° f the ^ne of a 

protein from another plant species; or 
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(d) a meiosis-specif ic promoter variant, mutant, 
allele or derivative of (a) , (b) or (c) ; or 

(e) a portion of (a) , (b) , (c) or (d) sufficient 
to confer meiosis-specif ic character to a promoter 

5 containing it. 

In various embodiments of the present invention a 
promoter which has a sequence that is a fragment, mutant, 
allele, derivative or variant, by way of addition, 
insertion, deletion or substitution of one or more 

10 nucleotides, of the sequence of the promoter shown in 

Figure 4 (A) , has homology with the shown sequence which is 
at least about 5% greater than the homology that the 
ArLIMIS promoter sequence (Sato et al . ) has with the 
sequence shown herein, preferably at least about 10% 

15 greater homology, more preferably at least about 20% 

homology, more preferably at least about 25% greater 
homology. The sequence in accordance with an embodiment of 
the invention may hybridise with the shown sequence but not 
the ArLIMIS promoter sequence under appropriately stringent 

20 selective hybridisation conditions. A promoter according 

to the invention may include one or more motifs that appear 
in Figure 4 (A) and are able to confer meiosis-specif icity 
on a promoter which contains them, and which are not 
present in the ArLIM15 promoter of Sato et al . 

25 The present invention also includes meiosis-specif ic 

promoters that are homologous to the AtDMCl gene promoter. 
Further, the present invention includes meiosis-specif ic 
promoters of the gene of a homologous DMC1 protein from 
another plant species. An homologous promoter or nucleic 

30 acid encoding an homologous DMC1 protein may show greater 

than 55% homology with the sequence of Fig. 4A or Fig. 5A 
or Fig. 5B, greater than 65% homology, greater than 75% 
homology, greater than 85% homology or greater than 95% 
homology. Such homology may be shown for a sequence of at 

35 least 20 nucleotide bases, at least 50 nucleotide bases, at 

least 100 nucleotide bases, at least 300 nucleotide bases 
or at least 500 nucleotide bases. 
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able to promote traMcrlption '™ «- U3S e d 
specific manner, operably linked to k 3 meiosis " 

acid se q uence, preferably a gene * ^ 
By "heterologous" is meanf ! ' 9 ' a Codln 9 sequence. 

ccdi„ g seguence . Wi£ : :,n; «*. 

may be excluded. Genera] i v 1119 sec 3 ue nce 

which may T:^ m zrzz h i transcribed 

Polypeptide product which may be detected a , " 

quantitated following expression. T gene 

Product may be assayed following expression 

"reporter gene- ± e „ expression is termed a 

y ' 1 - e - a 9ene which " record » ™ 
activity. reports on promoter 

The present invention 
vector comprising a proper ' he """^ 
vector may comprise a suitable aiSClosed herein. Such a 
or other means f or ^J^Tl^^™^™ ^ 

S e q uence S ;r;r e ; f "*r-- p-™*^ 

seances as appropriate"; ~ 

example, MoJecular cloning a r h 3 See ' * or 

rr J;—;- - - 

Tnus, a further aspect of , k . 
provides a host cell , „ 1 Present invention 

containing a nucleic ac ld ^ " 
element. as disl osed he^L"^: "T^" 9 * 
heterologous nucleic acid linked C ° - 

(urther aspect provides a TtTT* " ^ * 
such a construe/ into a t^ 0 — 
e^Ploy any available technique I n ^T^ 1 ^ 



WO 98/28431 



PCT/GB97/03546 



skilled in the art. 

The introduction may be followed by causing or 
allowing expression of the heterologous nucleic acid 
sequence or gene under the control of the promoter. 
5 In one embodiment, the construct comprising promoter 

and nucleic acid sequence or gene integrated into the 
genome (e.g. chromosome) of the host cell. Integration may 
be promoted by including in the construct sequences which 
promote recombination with the genome, in accordance with 

10 standard techniques. 

Many known techniques and protocols for manipulation 
of nucleic acid, for example in preparation of nucleic acid 
constructs, mutagenesis, sequencing, introduction of DNA 
into cells and gene expression, and analysis of proteins, 

15 are described in detail in Short Protocols in Molecular 

Biology, Second Edition, Ausubel et al . eds. , John Wiley & 
Sons, 1992, the disclosure of which is incorporated herein 
by reference. 

Nucleic acid molecules, constructs and vectors 

20 according to the present invention may be provided isolated 

and/or purified (i.e. from their natural environment), in 
substantially pure or homogeneous form, free or 
substantially free- of a coding sequence, or free or 
substantially free of nucleic acid or genes of the species 

2 5 of interest or origin other than the promoter sequence. 

Nucleic acid according to the present invention may be 
wholly or partially synthetic. The term "isolate" 

encompasses all these possibilities. 

An aspect of the present invention is the use of 

30 nucleic acid according to the invention in the production 

of a transgenic plant. 

When introducing a chosen gene construct into a cell, 
certain considerations, well known to those skilled in the 
art must be taken into account . The nucleic acid to be 

35 inserted should be assembled within a construct which 

contains effective regulatory elements which will drive 
transcription. There must be available a method of 
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transporting the construct into the cell. Once the 
construct is within the cell membrane, integration into the 
endogenous chromosomal material either will or will not 
occur. Finally, as far as plants are concerned the target 
cell type must be such that cells can be regenerated into 
whole plants. 

Plants transformed with the DNA segment containing the 
sequence may be produced by standard techniques which are 
already known for the genetic manipulation of plants. DNA 
can be transformed into plant cells using any suitable 
technology, such as a disarmed Ti-plasmid vector carried by 
Agrobacterium exploiting its natural gene transfer ability 
(EP-A-270355, EP-A-0116718 , NAR 12(22) 8711 - 87215 1984), 
particle or microprojectile bombardment (US 5100792, EP-A- 
44-1882, EP-A-434616) microinjection (WO 92/09696, WO 
94/00583, EP 331083, EP 175966, Green et al . (1987) Plant 
Tissue and Cell Culture, Academic Press), electroporation 
(EP 290395, WO 8706614) other forms of direct DNA uptake 
(DE 4005152, WO 9012096, US 4684611), liposome mediated DNA 
uptake (e.g. Freeman et al . Plant Cell Physiol. 29: 1353 
(1984)), or the vortexing method (e.g. Kindle, PNAS U.S.A. 
87: 1228 (1990d) Physical methods for the transformation of 
plant cells are reviewed in Oard, 1991, Biotech. Adv. 9-. l- 



11 



Agrobacterium transformation is widely used by those 
skilled in the art to transform dicotyledonous species. 
Recently, there has been substantial progress towards the 
routine production of stable, fertile transgenic plants in 
almost all economically relevant monocot plants (Toriyama, 
et al. (1988) Bio/Technology 6, 1072-1074; Zhang, et al . 
(1988) Plant Cell Rep. 7, 379-384; Zhang, et al . (1988) 
Theor Appl Genet 76, 835-840; Shimamoto, et al. (1989) 
mature 338, 274-276; Datta, et al . (1990) Bio/Technology 8, 
736-740; Christou, et al . (1991) Bio/Technology 9, 957-962; 
Peng, et al . (1991) International Rice Research Institute, 
Manila, Philippines 563-574; Cao, et al . (1992) Plant Cell 
Rep. 11, 585-591; Li, et al . (1993) Plant Cell Rep. 12, 
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™'7e* 7 h ° re ' " ' 1993> oology 21 

371-884; Fromm, eC al . U990, Sio/Wl]oJ 83^.39' 

Gordon-Kamm. et al . , Cell 2 " " ' 

D'Halluin. et al. U992, plane Csll «"; s18 ' 
et a] /1QQ01 o7 ^ 1505; Walters, 

(1992, Plant Molecular Biology is. 18 9-200; Koziel 
« -i- U993, Biot.cHn.logy n. a94 _ 200; ^ K °^' 

^y^olojy 102 , 1077-1084; S«rs . et al 
U992, Bio/recHnology 1„. 1589 _ 1594; M092/14828) ^ 

ei"i:r r a^ robacterium mediated is now 

emerg lng also as a n highly efficient alternative 
transformation method in monooots (Hiei et al. ( 1994) rhe 
Plant Journal s, 271-282, . 

achie^H 9ene K rati0n ° f fe " ile transgenic plants has been 
achieved in the cereals rice, maize, wheat, oat, and barley 
(renewed in Shimamoto. K. ( i 994) Current . opinion in 
Biotechnology 5, 158-162.; Vasil, et al . (1992, 
Bio/Technology 10 , 667 . 674; Vain £t al 1 2) 
Biotechnology Mvmemml3 (4) , 6S3 . 671; Vasil _ ^ ^ 
Biotechnology 14 page 702) . 

Microprojectile bombardment, electroporation and 
direct DNA uptake are preferred where Agrobacterium is 
xneff.c.ent or ineffective. Alternatively, a combination 
of different techniques may be employed to enhance the 

wLhT^ ^ tranSf — tion Process, eg bombardment 

w.th Agrobacterium coated microparticles (EP.-A-486234) or 
microprojectile bombardment to induce wounding followed by 
co-cultxvation with Agrobacterium (EP-A-486233 ) 

Following transformation, a plant may be regenerated, 
eg from s.ngle cells, callus tissue or leaf discs, as is 
standard in the art. Almost any plant can be entirely 
regenerated from cells, tissues and organs of the plant 
Aval able techniques _ ±n vas±i ^ P 

35 XXX lV nd S ° matiC ^ GGnetiCS ° f P1 * ntS < ™ *' « 

XXX, Laboratory Procedures and Their A PP l ications , Academic 

Press, 1984, and Weissbach and Weissbach, Methods for Plant 
Molecular Biology, Academic Press, i 989 . 
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The particular choice of a transformation technology 
will be determined by its efficiency to transform certain 
plant species as well as the experience and preference of 
the person practising the invention with a particular 
methodology of choice. It will be apparent to the skilled 
person that the particular choice of a transformation 
system to introduce nucleic acid into plant cells is not 
essential to or a limitation of the invention, nor is the 
choice of technique for plant regeneration. 

Also according to the invention there is provided a 
plant cell having incorporated into its genome nucleic 
acid, particularly heterologous nucleic acid, as provided 
by the present invention. A further aspect of the present 
invention provides a method of making such a plant cell 
involving introduction of a vector including the sequence 
of nucleotides into a plant cell and causing or allowing 
recombination between the vector and the plant cell genome 
to introduce the sequence of nucleotides into the genome. 
The invention extends to plant cells containing nucleic 
acid according to the invention as a result of introduction 
of the nucleic acid into an ancestor cell. 

Plants which include a plant cell according to the 
invention are also provided, along with any part or 
propagule thereof, seed, selfed or hybrid progeny and 
25 descendants. A plant according to the present invention 

may be one which does not breed true in - one or more 
properties. Plant varieties may be excluded, particularly 
registrable plant varieties according to Plant Breeders' 
Rights. It is noted that a plant need not be considered a 
"plant variety simply because it contains stably within 
its genome a transgene, introduced into a cell of the plant 
or an ancestor thereof . 

In addition to a plant, the present invention provides 
any clone of such a plant, seed, selfed or hybrid progeny 
and descendants, and any part of any of these, such as 
cuttings or seeds. The invention provides any plant 
propagule, that is any part which may be used in 
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reproduction or propagation, sexual or asexual, including 
cuttings, seed and so on. Also encompassed by the 
invention is a plant which is a sexually or asexually 
propagated off -spring, clone or descendant of such a plant, 
5 or any part or propagule of said plant, off-spring, clone 

or descendant . 

The AtDMCl promoter may be used in meiosis-specif ic 
expression of a heterologous sequence, including a variety 
of cytotoxic genes, eg ribosome inactivating proteins (Lord 

10 et al . , 1991) or cytotoxic RNase barnase (Goldman et al . , 

1994), DNA modifying enzymes including rare-cutting site- 
specific endonucleases , eg Xbal , I-Scel, HO endonuclease 
(Brenneman et al . , 1996; Puchta et al . , 1996; Chiurazzi et 
al . , 1996; Haber, 1995), or recombinases , eg FLP 

15 recombinase (Kilby et al . , 1995), recombinase from 

Zygosacharomyces rouxii (Onouchi et al . , 1995), 
bacteriophage PI ere recombinase (Osborne et al . , 1995) as 
well as different transcription factors (Meshi & Iwabuchi, 
1995; Ramachandran et al . , 1994), protein kinases and 

20 phosphatases (Stone & Walker, 1995) , cell cycle regulators 

(Ferreira et al . , 1994; Dahl et al . , 1995), which are 
normally not expressed during the time of meiosis. This 
may serve different purposes, such as ablation of meiotic 
cells and isolation of apomictic plants, designing an 

25 efficient homologous recombination system for plants, 

increasing meiotic recombination frequency, for 
introgression of alien chromosome segments into host plant, 
or altering normal events of cell cycle during the time of 
meiosis and producing male and female sterile plants. 

30 Easily assayed reporter genes, eg GUSA ( Jef f erson, 1987) or 

GFP (Sheen et al . , 1995) under control of AtDMCl promoter 
may be used as markers for detection of early meiotic 
events in plants, especially for the analysis of different 
mutations affecting meiosis. 

35 The AtDMCl promoter may be used for improving 

transposon tagging efficiency. The main factors determining 
the efficiency of transposon-based system in the tagging of 
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host genes are the frequencies of transposon excision and 
"insertion, and the independence of transposition events 
The last factor is crucial for a system with high level of 
excision and reinsertion events, as such system often yield 
clonal transpositions. Using the AtDMCl promoter, driving 
the transpositions at the restricted stages of plant 
development, particularly at the early stages of meiosis 
improves the efficiency of transposon tagging by producing 
only unique (independent) transposition events. 

Therefore, the present invention provides a method of 
transposon tagging comprising the steps of creating 
construct comprising a promoter as described above operably 
linked to a transposase required for transposition of a 
transposable element; transforming a plant cell with said 
construct such that the transposase is driven by said 
promoter; and determining transposition events. 

The AtDMCl promoter may be used in searching for 
apomictic mutants or used by seed producers to produce 
seeds apomictically. Apomixis is the definition of asexual 
reproductive processes that occur in ovules of flowering 
plants (Koltunow, 1993). Mutants with fertilisation- 
independent seed development have been recently described 
for Arabidopsis (Ohad et al . , 1996; Chaudhury et al.,1997) 
Apomictically produced plants are genetically identical 
with the female parental plants. Apomictic reproduction may 
therefore be beneficial for agriculture, as. it may be an 
inexpensive way to preserve given genotype through 
successive generations. 

Ablation of meiotic cells or changing their fate 
through altering events of meiotic cell cycle can be 
extremely beneficial for apomictic seed production, as it 
will eliminate sexually produced seeds from the progeny. 
Thus, the only seeds that will survive will be apomictic. 

Schematic presentation of an exemplary construct 
suitable for searching for apomictic mutants as well as for 
the apomictic seeds production is shown in Figure 1A. The 
construct carries meiosis-specif ic promoter (msp) fused 
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through a DNA insertion sequence (DNA ins) to a 
modification nucleic acid sequence, preferably a gene (tx) 
encoding a cytotoxic protein. The DNA insertion sequence 
may be any DNA sequence (another gene, eg transformation 
marker or counter selectable marker) which does not confer 
transcription of cytotoxic gene and is sufficiently long to 
prevent transcription of cytotoxic gene from the meiosis- 
specific promoter. DNA insertion is preferably flanked with 
DNA sequences which are the target sequences (ts) for an 
activator, preferably a DNA recombinase (r-se) , under 
control of inducible promoter (ip) . The target sequences 
may be, for example, lox recombination sites with the same 
orientation in the case of the ere recombinase, or the 5' 
and 3' Ac ends for the Ac transposase. 

The transcription of DNA recombinase gene may be 
controlled by an inducible promoter (ip) , for example, 
promoter of heat shock inducible gene (Yoshida et al . , 
1995) . Optionally, any other inducible system may be used, 
for example, copper-controllable (Mett et al . 7 1993), or 
glucocorticoid-controllable (Aoyama et al . , 1995) gene 
expression system, but in this case the constructs design 
will be more complex, as such systems are transcription 
factor mediated. 

Induction of the recombinase or transposase gene 
expression will lead to the elimination of the DNA 
insertion due to the homologous recombination between two 
flanking target sequences in the case of a cre/loxP- like 
system (one target sequence will be left behind) or 
complete excision of the DNA insertion together with both 
target sequences in case of Ac/Ds- like system. This will 
lead to the meiosis-specif ic transcription of cytotoxic 
gene and, consequently, meiotic cells death. For some 
powerful cytotoxins (barnase) the low expression level of 
inhibitor protein (txi) (barstar) would be an advantage, as 
it may prevent negative effect of possible leakage in 
cytotoxic gene expression. 

This system may be useful, firstly for apomictic seed 
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when reproduction is not successf J ^Zl;^ 1 ' 
species (Koltunow, 1993) m thi, c"rus and grass 

Plant parts eg pollen carrvL a "**" " * lmt ' « 

Figure 1A T vT Carrying a construct as described in 
Figure 1A. may be mutagenised and the site- specific 

r^r 86 9enS SXP " SSi - — - induced in seu!" 

T d , ™ --agenised seeds, m this situation vLbTe 
seeds „m only be produced apomictically in an appropr.a tl 
mutant background. ppropriate 

15 , inCreasi "9 the efficiency of the system it may be 

an advantage to use easily screenable counter selectable 
marker, for example a bacterial cytochrome P450 (O'Keefe et 
al.. 1994) or phosphonate monoester hydrolase (pehA) 
(Dotson et al 199s) as the DNA insertion , DNfl ins, (s^e 

Into herb !T 9en£S ""^^ C ° nVerS i0n ° f P— herbicides 
into herbrcrdes and any seedlings (potentially false 

posrtrves, with inactive cytotoxic gene <tx, (because of 

the presence of DNA insertion, win be easily removed by 

the treatment appropriate pro-herbicide 

i„vo/ n thiSCaSe the USe ° f "e/lox -"ke system (i.e. one 
involv „g s.te-specific recombination, is preferable to 

lead t; 1 ^ SS -""v-tion of first system win 

to the loss not reintegration, of the DNA insertion. 
M The "fore, the present invention provides a method for 
30 L d f f UCln9 aP ° miCtically comprising the steps of 

Ze eic" 9 T Ce " S ^ lnc ~P««t^ i«o their genome a 
nucleic acid constmrh = , = 

u construct as described above so that the 

modification nucleic acid sequence is expressed in said 

me.otrc celis thereby e lim inating sexually produced seeos 

in plant regenerated therefrom. The nucleic acid 

35 construct may be incorporated into the eel! genome "y 

breeding techniques wel! known to the person skilled in the 

art or by standard transformation techniques 
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Also in accordance with the present invention there is 
provided a method of detecting apomictic mutants comprising 
the steps of generating a plant or plant part carrying a 
nucleic acid construct as described above; creating a 
5 variation for apomixis either in accordance with a breeding 

programme or in accordance with a mutagenesis programme; 
deriving a suitable self progeny population; inducing the 
activating gene in the self progeny as described above and 
detecting viable seeds produced apomictically . A suitable 

10 self progeny population will be understood by those skilled 

in the art to mean one where the phenotype of the mutation 
can be revealed e.g. where individuals are at least 
partially homozygous by selfing. Genetic variation could be 
created from breeding and selection of existing genetic 

15 stocks or by inducing such genetic variation, for example 

by mutagenesis. 

In Figure IB we present an exemplary construct 
permitting the removal of any unwanted DNA sequences from 
transgenic plants. It may be useful, for example, for 

20 eliminating genes conferring antibiotic resistance, which 

are required for selection of transformed plants, but are 
not necessary needed afterwards. Moreover, the presence of 
such genes can be damaging for the release of transgenic 
crops (or at least controversial, given the current 

25 political climate and food scares) . The use of Cre/lox 

recombination system to remove selected* genes from 
transgenic plants was already reported (Dale & Ow, 1991). 
The system described by Dale and Ow (1991) required the 
second round of transformation in order to introduce 

30 recombinase source under the control of constitutive 

promoter. In our construct meiosis specific promoter (msp) 
drives the transcription of site specific recombinase gene 
(r-se) , which, e.g. as it was described above, eliminates 
DNA of interest (DNA ins) through the mechanism of 

35 homologous recombination during the first cycle of sexual 

reproduction in transgenic plant unlike the cre/lox- system. 
It will not require any additional rounds of transformation 
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as wall as the use of inducible promoters to drive 
recomirnase gene. The recombination event can be £Z 

DNA insertion between constitutive propter Icp, IZ 
reporter gene <rg, or by using PCR screening 

Therefore, an aspect of the present invention is to 
prevent seed derived fro* sexual reproduction so that only 
apcnrctrc seed are set (in plants where apomixis can 
occur, . This would be of interest for species where making 
hybr ld seed is uneconomic on a large scale. The hybrid 
seed may be made on a small scale and then be multiplied 
apomictically to produce large scale quantities of hybrid 
seed. Hybrid seed is desirable because hybrids generally 
perform better then inbreds . 

Another aspect of the present invention is to identify 
Plants with a tendency to form apomictic seed. This can be 
achxeved in connection with a mutagenesis or breeding 
programme (see Fig. 1A) . Furtner , the present invention 
also relates to a method for removing pieces of nucleic 
acxd from transgenic plants (e.g. sequences just needed to 
transform/select transgenic cells/plants) . 

Aspects and embodiments of the present invention will 
now be illustrated, by way of example only, with reference 
to the accompanying figures. Further aspects and 
embodiments will be apparent to those skilled in the art 
All documents mentioned herein are incorporated by 
reference . 

Brief deflcription o f thc , 
In the Figures: 

Figure 1 shows schematic representation of uses of 
mexosis-specific promoter in searching for apomictic 
mutants, maintaining the apomictic mechanism of seeds 
production and in removing of unwanted DNA sequences from 
transgenic plants. Abbreviations: tm - transformation 
marker; wp - weak promoter; tx- gene conferring toxic 
Phenotype; txi - inhibitor of tx ; msp - meiosis-specif ic 
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promoter; ts - target sequence for r-se gene product; DNA 
ins - any DNA sequence sufficiently large to block the 
meiosis-specif ic transcription of tx gene; cp 
constitutive promoter; rg - reporter gene; ip - inducible 
5 promoter; r-se - gene conferring DNA recombination. 

Encircled parts of the constructs are optional. 

Figure 2 shows the restriction map (A) of the AtDMCl 
gene and T-DNA region of binary vector carrying pAtDMCl:GUS 
fusion. The positions of degenerate primers used to produce 

10 AtDMCl-specif ic probe A are shown by arrows. Probe B, which 

was used for RFLP mapping, corresponds to the 5.9 kb EcoRV 
fragment of genomic clone. The positions and sizes of 15 
exons are shown by open boxes; the ATG and the TGA 
translation initiation and translation termination codons 

15 are also shown. The promoter region used for pAtDMCl :GUS 

translational fusion is shown as solid line. Figure 2(B) 
shows the schematic representation of the plasmid SIjJ7753 
T-DNA region introduced into A.thaliana in order to assess 
the specificity of the AtDMCl promoter. The construct 

2 0 carries a pAtDMCl:GUS translational fusion and the NPT gene 

as a transformation marker. 

Figure 3 shows nucleotide sequence of the AtDMCl 
promoter and adjacent region. The nucleotide sequence from 
-450 to +330 is shown. The transcription start site is 

25 designated +1. The putative TATA box sequence is boxed. The 

alternative putative TATA box is overlined. Two nearly 
complete and two complete direct repeats are underlined. 
Consensus sequences of splicing are indicated by double 
underlining. The ATG codon in the second exon (shown in 

30 bold) was mutagenised in order to introduce an Ncol site 

for translational fusion with the GUS gene. 

Figure 4 shows (A) DNA sequence of the AtDMCl promoter 
including part of the transcribed region. Transcription 
start site located at the position 4691- (B) Alignment of 

3 5 the promoter regions of the AtDMCl (top strand) and ArLIM15 

(bottom strand) genes upstream of their transcription start 
sites. Only identical sequences in the alignment are shown. 
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Figure 5 shows DNA sequences of plant DMC1 homologues. 
Figure 5(A) shows partial genomic DNA sequence of barley 
DMC1 homologue (HvDMCl). Putative promoter region is 
located upstream of the translation start site (position 
1599) . Genomic DNA region amplified by the primers MEI1 and 
MEI4 is located between the positions 4001 - 4707. 
Figure 5(B) shows partial genomic DNA sequence of tomato 
DMC1 homologue (LeDMCl) . Genomic DNA region amplified by 
the primers MEI1 and MEI4 is located between the positions 
466 - 1240. 

Figure 6 shows a schematic presentation of the T-DNA 
regions of constructs SLJ11332 (A) and SLJ112315 (B) . 

Figure 7 shows alignment of the amino acid sequences 
of the AtDMCl , LIM15 and DMC1 proteins. Identical and 
conserved amino acid residues are boxed in black and grey, 
respectively. Gaps in the alignment are shown by dots. The 
positions and orientations of primers are shown by arrows. 
The vertical open arrowheads indicate the positions of 
introns in the AtDMCl gene. The A and B motifs of the 
consensus sequence for the purine nucleotide binding site, 
detected by visual inspection, are underlined. 

Figure 8 shows .the restriction map of the promoter 
region of the ArLIMIS gene. The positions of the first five 
exons are shown by open boxes. The insertion of transposon- 
like element, Limpetl, flanked by two 9 bp direct repeats 
is also shown. 



Detailed description a nd exemplification of the invention 
Materials and Methods 
Plant material 

The plants used in this study were Arabidopsis 
thaliana Columbia and Landsberg erecta ecotypes . Plants 
were grown in the greenhouse at 25oC under 16 hrs of 
illumination and automatic watering conditions. 

Plant transforms tion 

Plant transformation of Arabidopsis thaliana (ecotype 
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Columbia) was performed as described (Bechtold et al . , 
1993). Seeds were harvested three weeks after the vacuum- 
infiltration, sterilised and screened for transf ormants on 
GM + 1% glucose medium (Valvekens et al . , 1988) containing 
5 5 0 mg 1-1 kanamycin. 

DNA isolation and DNA gel blot analysis 

Genomic and cosmid DNA isolation from the plant tissue 
and E. coli respectively was performed as described 

10 (Klimyuk et al . , 1995; Sambrook et al . , 1989). The DNA was 

digested with restriction enzymes and DNA fragments were 
separated on 1% agarose gel, transferred to Hybond-N 
membranes (Sambrook et al . , 1989), immobilised on the 
membranes by UV crosslinking (UV Stratagene stratalinker 

15 2400) and subsequently baked on the membranes for 1 hour at 

8 0oC. The hybridisation procedure was performed as 
described by Church and Gilbert (1984) . DNA fragments for 
use as probes were gel -purified and were labelled using 
commercially available oligolabelling kit (Pharmacia) . 

20 

Subcloning and DNA Sequencing 

All subclonings and template preparations were done 

using the phagemid BlueScript (KS+) vector (Stratagene) . 

The series of unidirectional 250-300 bp deletions were 
25 carried out for large inserts using the Erase-a-Base system 

(Promega) . The sequencing reactions were performed by using 

DyeDeoxy Terminator cycle sequencing kit (Applied 

Biosystems) . In some cases the PCR products were sequenced 

directly. Sequence analysis was carried out on ABI 3 73A DNA 
30 sequencer (Applied Biosystems) . DNA contig, carrying AtDMCl 

gene sequence, was built up by using Autoassembler 

programme (Applied Biosystems) . 

Construction of pAtDMCl :GUS fusion 
35 The Ncol site was introduced by site-directed 

mutagenesis (Kunkel, 1985) at the position of the first ATG 
codon (shown in bold in Figure 3) in the second exon of 5.9 
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kb EcoRV subclone of the AtDMCl gene to M v , 
SLJ7731. The sequence of the primer d. ^ 
for mutagenesis is- 5 '- TAG 7rr ™ dSS19nated MUT * used 
CCC CAT CCA OCT CGT TGA ■ ^ ^ ™ 

5 - -Id. The final construct 1 

and Pstl r-^ct-v-i ^ ,• yt;facea with Ncol 

rsti restriction enzymes, pel nunfi^ = , , . 

s LJ77 «, ca « yin9 PAtDMC1:GUS 3 , ocs f jjz\i s( zt 

vector, „ as digested with „ paI and SmaI ^.^f'***' 
10 enzymes. The large fragment, released by this 

s r if r r subcioned int ° Hpai - - : 

fc>tiJ4 9i, based on pRK290 Tin^ -f * 

«» r », wa S ^^ed iir 4r a .rr- e :::ii:: 

experiments. transformation 



Histoche/nicaJ assay 

20 X G l u c T r t n a Sf0 r d PlantS Carryi " 9 T " DNA ° f S "" 53 ««e 

XGluc staged at the different stages of development as 

P~v.ou.ly described (Klimyu* et a! . . 1995) . Sq u a shes Qf ^ 
Glue stoned, pigment -washed flower buds were prepared as 
descried below. Specimens were placed in eppendorf luZ 
and vacuum-infi ltra ted with immersion oil in SpeedvL 
2S concentrator svcaooH , Savant, . Then buds with remains If 

immersion oi! were placed on slide and gently -slashed with 
cover sl lp . The squashes wer£ examined ch 

Axiophot microscope. SS 

30 fl hiSt ° Chemical Realisation of GUS activity in 

f ower buds was performed on 10 mm cross-sections Tt 
Historesin-embedded plant material as it previously 
described (Dolan et al . , 1994) Pftr . previously 
t_ h(a .. ' xy94 >- For better visualisation of 

Gained T , "™ ^ °' "ion. were 

earned for ! mm with 0.01% Safranin. The sections were 
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examined with a Nikon Microphot SA mic »«« 
a dart field condenser. ° Ph0t " SA =°P* equipped with 

The following criteria were used to identify the 
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stages of flower development (Bowman, 1994) : stage 9 - 
petal primordia stalked at base, pistil length (from the 
top of the stigma to the point of attachment to the 
receptacles) is 0.15 - 0.4 mm; stage 10 - stamen filaments 
5 begin to elongate, pistil length is 0.4 - 0.5 mm; stage 11 

- stigmatic papillae appear, pistil length is 0.5 - 1.5 mm. 

RT-PCR and identification of cDNA ends (RACE) 

RT-PCR analysis of AtDMCl expression was performed 

10 with MEI1U (5'-GGA GGG AAT GGA AAA GTG-3') primers using 

1/ig of total RNA from 12 day-old seedlings, leaves and 
floral buds as a template. The positions and orientations 
of the primers coincide with those of their degenerate 
homologues, MEI1 and MEI4 (Figure 2A) . As an internal 

15 control, primers for Antirrhinum majus polyubiquit in mRNA 

(GenBank accession number X67957) , kindly provided by Dr M. 
O'Dell, were used: 1392 (5'-CAG ATC TTT GTG AAG ACT CTG-3') 
And 1393 (5' -GGA CTC CTT CTG GAT GTT GTA - 3 ' ) . Primer 1393, 
directed in antisense orientation was used for cDNA strand 

20 synthesis. 

RNA in situ hybridization 

Digoxigenin- labeling of RNA probes, tissue preparation and 
in situ hybridization were performed as described by 
25 Bradley et al . (1993) and Coen et al . (1990). 

X-ray treatment of Arabidopsis plants 

Twelve-day-old Arabidopsis plants grown in small (5cm in 
diameter) plastic petri dishes were exposed to 5 and 10 
30 krad of ionizing irradiation. The ABB 6 MV linear 

accelerator served as a source of radiation. One hour 
after irradiation plants were used for X-Gluc staining and 
RNA isolation. Non- irradiated plants served as a control. 

35 RNA gel blot analysis 

RNA samples were separated on 1.4% agarose- formaldehyde 
gels as described by Ausebel et al . (1987). Agarose gel 
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was rinsed in several changes of sterile DEP-treated 
distilled water in order to remove formaldehyde and blotted 
overnight in lOxSSC to Hybond-N membrane. Membrane was 
carefully rinsed with deionized sterile water and RNA was 
immobilized to membrane and hybridized with probe as 
described above for DNA gel blot analysis. 

EXAMPLE 1 

Isolation of the AtDMCl gene 

In order to isolate the Arabidopsis thaliana DMCl 
homologue, five different degenerate primers were designed 
corresponding to amino acid motifs conserved in LIM15 and 
DMCl proteins: MEI1- 5' GG (N) AA(GA) GT(N) GC (N) TA(CT) 
AT (ACT) GA 3'; MEI2 - 5' GA(CT) AC (N) GA(GA) GG (N) AC (N) 
TT(CT) (.CA)G(N) CC3 ' ; MEI3 - 5' A(GA)(TC) TT (TC) TG(TC) 
TG(N) C(GT)(TC) TC 3'; MEI4 - 5' AC (N) GC (N) AC (GA) TT (GA) 
AA(CT) TC(CT) TC(N) GC 3'; MEI5 - 5' GC (GA) TG (N) GC (N) 
A ( GA ) N AC (GA) TG (N) CC (N) CC 3 ' . The positions of primers 
and their orientations are shown in Figure 7. Different 
combinations of primers were used for PCRs with Arabidopsis 
genomic DNA as a template. The PCR reactions were performed 
with 0.05 fig of A. thaliana (Landsberg erecta) DNA as a 
template in a volume of 50 /zl in the presence of 2 fiM of 
each of the two selected primers in a buffer containing 250 
MM dNTPs (Pharmacia) 10 mM Tris-HCl, pH8 . 3 , 50 mM KCl , 2.5 
mM MgCl2, 0.05% Nonidet P-40, and 2.5 units .of "AmpliTaq" 
thermostable DNA polymerase (Perkin Elmer Cetus) . Cycling 
conditions were: 94 oC for 15 sec; 50 oC for 30 sec; 72 oC 
for 2 min; 35 cycles, followed by a 10 min extension at 72 
oC. The largest PCR products were reamplified with nested 
primers. Only two sets of primers, MEI1 - MEI5 and MEI1 - 
MEI4, proved effective. Primers MEI2 and MEI3 did not 
amplify the expected size class either together or in 
combination with other primers. Subsequent analysis showed 
that they anneal to parts of the cDNA sequence that are 
interrupted by introns in genomic DNA (Figure 7) . The major 
679 bp PCR band, obtained after the reamplif icat ion with 
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MEI1 and MEI4 primers of the first round PCR product (MEI1 
and MEI5 primers) , was blunt-ended by treatment with T4 
polymerase, subcloned into the EcoRV site of pBS(KS+) and 
sequenced. The fragment, designated as probe A (Figure 2A) , 
encoded 111 amino acid sequence with 93% of identity to the 
homologous part of LIM15 15 protein (Kobayashi et al . , 
1993) . 

Two unique primers, -MEI1U (5' GGA GGG AAT GGA AAA GTG 
3') and MEI4U (5' GCA ACG TTG AAC TCC TCT GCA AT 3') , that 
annealed to the same location as their degenerate 
homologues (Figure 2A) , were synthesised and used for the 
PCR screening of DNA pools prepared from 68 plates (384 
clones per plate) of an four genomic equivalents 
Arabidopsis cosmid library. The library was made on the 
basis of CLD04541 binary vector and was kindly provided by 
C. Lister and C. Dean (Cambridge Laboratory, JIC, Norwich) . 
The probing of filter replicas from positive plates with 
probe A recovered five cosmid clones. One of the clones, 
called 64/23/C, was used for the restriction mapping and 
subcloning of 5.9 kb EcoRV and 3.6 kb Clal overlapping 
fragments into the pBS ( KS+ ) vector. Both strands of two 
overlapping fragments , encompassing 8 kb of cosmid insert, 
were completely sequenced. 

The DNA sequencing and sequence analysis was performed 
as described above. Database searches with the BlastX 
program (Altschul et al . , 1990) revealed a gene within this 
region, designated AtDMCl, whose highest homology was to 
the ArLIM15 protein followed by LIM15 and its human and 
mouse homologues as well as several RAD51 homologues and 
other RecA-like proteins, including yeast DMC1 , from 
different eukaryotic organisms. The GenBank accession 
number for the AtDMCl gene is U76670. DNA gel blot 
analysis of AtDMCl revealed that this is a single copy 
gene . 

Molecular mapping 

Labelled 5 . 9 kb EcoRV fragment of AtDMCl genomic clone 
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(designated as probe B . p 

probe Southern blot of Z.h-f lnitia "y used to 

Landsberg erecta and ^^^1' eta "»- <~otypes 
KPLPs . AtDMCl was shown * £ ™* C ° -P-cific 
BcoHV. Xba!. „ indIII , HpaI an b ; B ^™^ »ith EcoKl, 

thallana Landsberg erecCa " EI en ^= between A. 

subse quently , Southern jo! ^i, acotypes 

isolated fro m 41 recombinant inbred lR1 Ti ■ DNAS 

eootypes Landsberg erecta and r , w betue ™ the 

» 9 3, were hybridised " th «" 

of AtDMCl gene was carried out bv "I 01 "" 1 " 

MAPMAKER v.l.o (Lander at al 198 ^ 9 *** 

segregation of 92 Sinai- " and data for the 

Arabidopsis chromosomes ^ "» «ve 

The..,RELP for the AtDMn , 

chromosome 3 between the^Tnd™ ^"V* *» ^ 

" ChS Stances of 4 . a c M and 7 7 " «rk.r. 

markers respectively. I n additi " fr ° m each « the 

recombinant inbred lines <* AtDMCl using 

-otypes led to the identific t 9 *"* C ° lurabla 
- conclusion that ^'Z^'^^ ~ 

Characterisation of the A tDMC 7 ,- 

Screenina of transcript by RT . PCR 

reenmg of an Arabidopsis <„„ 
library made usi za ptt "florescence cDNA 

Dangl) „ ith prob£ 9 A ^ ^ ve «°- (distributed by j. 

probably due to the low ^uLZTTj^I 

the AtDMCl transcripts c„ tranS "°^ Presence of 
translations of the AtDMCl" COmParlSOn ° f conceptual 

Showed a very high degree o f aL ^ ^ 

<™>re than 80% identity, on t k° SSqUence homology 

exons and 13 i„ trons J rs pred^t °* ^ " 

in order to confirm the n , W " hin the AtDMCl gene. 

the boundaries ^J^.^^™" P-isely 

c^ed out as described .ProLa^ e^aT "f^. TT" 
1989) us ln9 total ^ i-< 1988; Frohman, 

Total RNA was prepared 7 *" 3 ten « >1 * t «- 

Prepared from Arabidopsis o.i - x mm 
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floral buds and young leaves by a standard method 
(Harpster et al . , 1988) or by method described by Napoli et 
al., (1990). Both methods produced good quality RNA, but 
the last one was preferred when small quantities of total 
RNA were required. 

For RT-PCR complementary DNA synthesis was carried out 
with 2 /ig of total RNA and primer MEI6U (5' ATC CTT CGC GTC 
AGC AAT GCC 3'). Second strand synthesis and PCR were 
carried out with primers 5RN (5' ATG CAG CTC GTT GAG CGT 
GAA 3') and MEI6U. PCR mixture was heated for 5 min at 
95oC, followed by 35 cycles of amplifications (94°C / 40 sec; 
56°C, 1 min; 72°C, 2 min) and 10 min final extension at 
72°C. 

According to the prediction, the primers should 
amplify 996 bp and 2 5 86 bp fragments from the cDNA and 
genomic DNA respectively. Two expected bands were observed 
after the electrophoretic separation of RT-PCR products and 
the 996 bp fragment was cloned and sequenced. The sequence 
confirmed the prediction, except that minor corrections 
were necessary to the predicted boundaries of some exons . 

Further steps were undertaken to identify the full 
length transcript, in particular the transcription start 
site. The GENEFINDER program (developed by S. Klostermann, 
Max-Planck Institute, Mart insried) predicted an additional 
short exon 5' to the first of the 14 exons that were 
initially identified. 

For identification of the transcription start site, 5' 
RACE complementary DNA synthesis was performed with 5 /ig of 
total RNA and primer 5R2 (5' TCA GCA GCT TCA CAG ATT TTG 
3'). Second strand cDNA synthesis and first PCR 
amplification were done with QI , QT (Frohman, 1989) and 
5R2N (5' TCA ACT TTG GCC TCA GAT AAA C 3') primers. 
Reamplif ication was done with QI and 5R2N1 (5' TTC TTG GTA 
TGC ATC ATG AGALACTOSYL IGG 3 ' ) primers . Several 5 ' RACE 
products were size-selected by gel electrophoresis, 
subcloned and sequenced as described above for genomic DNA. 
The beginning of the longest was presumed to correspond to 
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the transcriptional start site rt» 

This result confined the exis^en 3 - 
allowed us to define the or t actional exon and 

Th» Promoter of the AtDMCl oene 

The promoter contains some interest,- 
of the putative TATA bo*. Four ^r^""'" 
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repeats. Znterestin^" ^1^^^' " * 
short palindromic repeat ATGCAT at their B e / 0 " 3 " 
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recombination and resistance to ionising radiation, (48.5% 
and 70.6% of identity and similarity respectively) . AtDMCl, 
like other RecA-like DNA strand-exchange proteins 
(Kowalczykowski and Eggleston, 1994), possesses consensus 
ATP binding sites (Walker et al . , 1982). 

Comparison of the AtDMCl and the ArLIMlS (Sato et al . , 
1995) transcripts and genomic sequences confirmed positions 
for most of the exon/intron junctions, but the borders of 
intron 14 of ArLIM15 gene appears to have been determined 
incorrectly. As a result, two amino acids, alanine (A) and 
glutamic acid (E) (positions 326 and 327 for AtDMCl 
protein) were excluded from the predicted protein sequence 
of ArLIM15 (Sato et al . , 1995). It is unlikely that 
splicing for this particular intron is different between 
Columbia and Landsberg erecta ecotypes . There is amino acid 
substitution at the position 103 (leucine in Columbia, 
serine in Landsberg erecta ) . There is also a difference in 
the sizes of transcripts: AtDMCl transcription start site 
is 4 bp upstream of that for the ArLIMIS and the last exon 
is 7 5 bp longer. 

In situ hybridization analysis of AtDMCl expression 

In order to test whether or not the AtDMCl gene is 
expressed at the time of meiosis, in situ hybridisation 
analysis using cross-sections of the whole inflorescence 
and DIG-labelled antisense AtDMCl RNA as - a probe was 
carried out. Digoxigenin- labelling of RNA probes, tissue 
preparation and in situ hybridisation were performed as 
described by Bradley et al . (1993) and Coen et al . (1990) . 

The expression of AtDMCl gene in whorl 3 is restricted 
to pollen mother cells. No expression was detected in 
tapetum. In whorl 4 hybridisation signal was restricted to 
megaspore mother cells of ovules. DIG-labelled sense AtDMCl 
RNA was used as a negative control and did not reveal any 
hybridisation signal at the stages of flower development at 
which the expression of AtDMCl might take place. Weak 
nonspecific signal was detected in the mature pollen grains 
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£ R T°"T, hyb " di -"°" «"h sense and antisense AtDMCl 
RNAs (data not shown). No signals were 

post mel ot lc o TO les and developing e^ryos as „eH as in the 
analysed vegetative tissues. 

Characterization of expression of a P AtDMCl:GUS fusion 

The mexosis-associated expression of the AtDMCl led us 
to investigate whether the AtDMCl promoter could direct 
mexosis-associated expression of the GUS reporter gene A 
translational fusion was made between the putative AtDMCl 
promoter and coding sequences of the GUS reporter gene 
(see: Construction of pAtDMCl-.GUS fusion in Materials and 
Methods) . 

The fusion consists of a 3.3 kb DNA fragment 
containing the AtDMCl promoter fused in frame with GUS at 
the position of the methionine residue located in the 
second exon (Figures 2b and 3) . The sequence of the 3 3 kb 
DNA fragment of the AtDMCl gene used to drive meiosis- 
specxfic expression of a GUS reporter gene and its 
alignment with previously published sequence of the ArLIMIS 
gene are shown in Figure 3 . As a result the GUS protein 
carries 13 AtDMCl amino acid residues at its amino 
terminus. These residues appear to be neutral with respect 
to GUS activity. A schematic representation of the T-DNA 
region carrying the P AtDMCl:GUS fusion is shown in Figure 
2b . 



Eight primary transf ormants with this pAtDMCl-GUS 
fusxon were obtained and analysed for the presence of GUS 
expression patterns. Three primary transf ormants did not 
reveal any GUS activity, one showed ubiquitous GUS 
expressxon and four transf ormants revealed GUS expression 
patterns which were restricted to the whorl 3 and whorl 4 
of flowers. No GUS expression was detected in roots, leaves 
and stems of these four transf ormants , except that one of 
them exhibited GUS expression in damaged tissues. 

There was also X-Gluc staining in the receptacles of 
some of the open flowers, but this pattern is very common 
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for plants carrying the GUS gene and may be considered as 
non-specific (Klimyuk et al . , 1995). 

The GUS expression initially appears in the anthers of 
approximately 0.2 mm long inflorescence buds and later in 
5 the carpels of more advanced, approximately 1 mm long, 

buds. Meiosis in anthers and carpels does not coincide 
(Bowman, 1994) . The beginning of meiotic prophase I in 
anthers takes place at stage 9 of flower development, while 
the meiotic events in carpels do not occur until stage 11 

10 (Bowman, 1994) . Cross- sections of X-Gluc-stained buds at 

the different stages of development showed that GUS 
expression first appeared in pollen mother cells of anthers 
from inflorescence buds early in stage 9. In late stage 9 
GUS staining increased dramatically. GUS expression in 

15 ovaries was first detected at stage 11 of flower 

development. 

This temporal and spatial coincidence of reporter gene 
expression driven by the AtDMCl promoter with the stages of 
floral bud development corresponding to the time of meiosis 

20 provides indication that AtDMCl promoter can drive meiosis - 

associated GUS expression. The presence of residual GUS 
activity adjacent to the main sites of localisation in 
anthers and ovaries is the result of artefactual indigo 
blue dye formation in surrounding tissue. The specificity 

25 of dye localisation can be improved by using 0.2 - 1 mM 

potassium f errocyanide/f erricyanide in staining solution 
(Jefferson, 1987) . However, this we found compromised the 
sensitivity of the protocol resulting in loss of the weak 
GUS expression patterns at the early stages of meiosis. 

30 Inflorescences of nontransf ormed Arabidopsis plants were 

used as a negative control and they did not reveal any 
patterns of X-Gluc staining. 

AtDMCl and ArLIMl 5 genes are different within the promoter 
35 regions 

Alignment of the AtDMCl and ArLIM15 (Sato et al . , 
1995) gene sequences revealed that these genes, isolated 
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10 



from ecotypes Landsberg erecta and Columbia respectively 
are virtually identical throughout their respective coding 
regions, but there are significant differences in the 5' 
sequences. The sequences diverge 219 bp upstream of the 
transcrxptional start site of AtDMCl gene. 

Different combinations of primers ATM2 (5' GCA ACT GAA 
TTT GTT TTC GTT TG 3 > ) , ATM1 (5' TTG ATT AGT GGA TCC GCA 
AAC AA 3') and AR2 (5' TAG ATG AAA CGA GTT TGA CAC ATG 3') 
were used for PGR amplification of genomic DNA, isolated 
from Landsberg erecta and Columbia ecotypes of A. thaliana. 
The PCR amplification was performed as it described above 
for isolation of genomic clones, except that each primer 
concentration was 0.1 [xM and cycling conditions were- 94°C 
for 20 sec; 58°C for 20 sec; 72 C for 2 min; 35 cycles, 
15 followed hy a 10 min extension at 72°C. Primers Tl (5' GGG 

AAT GTT CCA ATA TAA G 3 ' ) and T2 (5' GAG AAT ATT ACA CTC 
TTA AA 3') were used for amplification of Limpetl 
sequences. The set of primers AR2 - ATM2 produced the 
expected 446 bp PCR band for Columbia ecotype, but nothing 
for Landsberg erecta, whereas primers ATM1 - ATM2 produced 
the expected 342 bp PCR product for Landsberg erecta and 
2.2 kb PCR product from Columbia ecotype. The positions and 
orientations of primers are shown in Figure 8. 

This result confirmed that the AtDMCl and ArLIM15 
sequences described above are not derived from chimeric 
clones and reflect genuine differences in gene structure. 
To determine the nature of the rearrangement, sequence of 
the ArLIMlB promoter region was extended. This indicated 
that the difference between the AtDMCl and" ArLIMIS promoter 
regions was caused by a 1874 bp DNA fragment present in the 
ArLIM15 gene but absent in the AtDMCl (Figure 8) . The 
fragment is flanked by two 9 bp direct repeats and has 26 
bp imperfect inverted repeats, with 73% of identity, at its 
termini; internal to these are two shorter inverted repeats 
(GenBank accession number U76697) . The general structure of 
the fragment suggests that it is a transposable element and 
that it is probably a member of the class of transposons 
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that exhibit DNA-mediated transposition. The two 9 bp 
direct repeats at the ends of the element probably 
represent a target site duplication. Consistent with this 
suggestion such a duplication is absent within the AtDMCl 
promoter . 

The element, which we have designated Limpet 1 (LIM 
promoter entrenched transposon- like element) , was used as 
a probe to blots of genomic DNA from three different 
ecotypes of Arabidopsis thaliana. Primers Tl and T2 were 
used to produce Limpetl hybridisation probe by 
reamplif ication of the ATM1 - ATM2 PCR product from ecotype 
Columbia. DNA gel blot analysis indicated that Columbia 
ecotype contains Limpetl and at least two additional 
closely related elements, but no Limpetl or related 
sequences were found in Landsberg erecta . 

In order to assess whether or not the insertion of 
Limpetl affects the expression pattern of ArLIMIS , relative 
to the pattern exhibited by AtDMCl, RT-PCR was performed 
with MEI1U and MEI4U primers, using equal amount of total 
RNA isolated from floral buds and leaves of both ecotypes. 
The results of DNA gel blot analysis of RT-PCR products 
suggest that two ecotypes exhibit very similar and 
relatively high levels of expression in inflorescence 
tissue and similar but much lower levels of expression in 
the leaves. 

Isolation of barley and tomato DMC1 homologues 

Degenerate primers MEI1, MEI4 and MEI5, which proved 
to be efficient for isolation of AtDMCl gene, were used to 
generate gene-specific probes for barley and tomato DMC1 
homologues designated as HvDMCl and LeDMCl, respectively. 
The first amplification was performed with MEI1 - MEI5 
primers at the conditions described above for the isolation 
of the AtDMCl gene-specific probe. The first PCR product 
was 10 times diluted with sterile distilled water and 1/il 
of it was used as a template for re-amplification with MEI1 
- MEI4 set of primers. Agarose gel electrophoresis of PCR 
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products showed single major DNA band of approximately 700 
and 800 bp long for barley and tomato, respectively. Gel- 
purified DNA bands were blunt-ended, cloned into the EcoRV 
site of P BS(KS + ) and sequenced. Comparative analysis of the 
clones revealed that they encode amino acid sequences with 
more than 90% of identity to the homologous parts of LIM15 
and AtDMCl proteins. These clones encoding part of the 
LeDMCl and HvDMCl genes, were used to screen the tomato 
cosmid (Dixon et al . , 1996) and barley lambda (Stratagene) 
genomic libraries, respectively. Partial sequences of 
genomic clones recovered from the screens are shown in 
Figure 5. Database similarity search using BlastX program 
(Altshulet al., 1990) helped to identify the translation 
start codon of HvDMCl gene. Only HvDMCl genomic clone was 
sequenced far enough upstream of the coding region to 
confer at least 1.5 kb of the putative promoter sequences 
(Figure 5a) . 

Success in isolation of DMC1 homologues from 
Arabidopsis, barley and tomato using set of MEIl, MEI4 
and MEI5 degenerate primers is a convincing example that 
these primers can be used to isolate DMC1 homologues from 
monocot and dicot plant species. Comparative analysis of 
the coding regions of plant DMC1 homologues including 
soybean DMC1 homologue cDNA ( Database accession number 
U66836, direct submission 13 August 1996) and recently 
isolated partial sequence of rice LIMl-5-like gene 
(Database accession number U85613, direct submission 16 
January 1997) demonstrated that they are at least 70% 
identical. The most variable part of the coding sequences 
is located at the 5' end of cDNA and encompass 
approximately 80 bp from the translation start site. It 
also revealed that plant DMC1 homologues have conserved 
exon/intron structure, which explains the success of 
using MEIl, MEI2 and MEI4 primers in different species. 

EXAMPLE II 

Construction of pAtDMCl zbarnase fusion for meiotic cells 
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ablation . 

The plasmid SLJ7744, carrying pAtDMCl : GUS 3'ocs 
fusion in pBS(KS + ) vector, was digested with Ncol and 
EcoRV, large fragment was gel -purified and ligated with 
'5 Ncol - EcoRV fragment of pJB142 carrying barnase-barstar- 

CaMV polyA fusion. The final construct was Xbal digested 
and 6 kb gel-purified fragment, carrying pAtDMCl : barnase- 
barstar-CaMV polyA signal, was cloned into the Xbal site 
of the binary vector SLJ755I5. The final construct 
10 SLJ11332 (Figure 6a) was mobilised into Agrobacterium 

tumefaciens C58C1 strain and used in transformation 
experiments . 

Construction of vector carrying AtDMCl cDNA fused to 

15 potato virus X (PVX) amplicon 

The first 450 bp of the AtDMCl cDNA were amplified 
with Clal/1 (5 ' - CAAAATTCTATCGATCTCACTCTTCCAAGCTTA- 3 ' ) 
and Clal/2 (5'- CAAAAGCCTCTGTGATCGATGAGGTTTCAATTCCACC - 
3') primers with introduced Clal sites (shown in bold). 

20 PCR fragment was digested with Clal, gel -purified and 

cloned into the Clal site of binary vector pVDH401 
carrying PVX amplicon (Angell & Baulcombe, 1997) . The 
final construct SLJ112315 (Figure 6b) was mobilised into 
Agrobacterium strain and used in transformation 

25 experiments as it described in Materials and Methods . 

It was shown that amplicon-mediated gene silencing 
can be used as an efficient tool to suppress endogenous 
RNA sharing homology with the transgene (Angell & 
Baulcombe, 1997) . Considering that yeast dmcl mutant 

30 fails to form normal synaptonemal complex and, as a 

result, produces a very low percentage of viable spores 
(Bishop et al . , 1992), amplicon-mediated silencing of the 
AtDMCl expression can be an alternative way to switch off 
the mechanism of sexual reproduction in Arabidopsis. 

35 Coding sequences of Arabidopsis, tomato, barley and rice 

(Database accession number U85613, direct submission 16 
January 1997) homologues share 70 - 80% of homology. 
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Considering this, it is possible that amplicon- based 
AtDMCl cDNA can efficiently silence DMC1 homologues 
expression in other plant species. Alternatively, host 
DMC1 homologue cDNA can be used in amplicon to switch off 
the mechanism of sexual reproduction and to achieve 
apomictic seed production. 

The remarkable specificity of the AtDMCl which 
confers tight developmental regulation of reporter gene 
expression in whorl 3 and whorl 4, may serve as a model 
system to study the mechanism of such regulation. Four 
direct repeats identified upstream of AtDMCl TATA box 
(Figure 3) may play an important role in developmental 
regulation of AtDMCl gene expression. Functional 
dissection of the promoter and site-directed mutagenesis 
may help to identify cis- regulatory sequences controlling 
the transcription of the AtDMCl gene and to clarify 
possible involvement of direct repeats in such 
regulation. Such sequences may be used as "baits" in one- 
hybrid system. 

Identification of the minimal region of the AtDMCl 
promoter possessing all the sequences necessary to drive 
meiosis-specif ic transcription can be achieved by 
preparing constructs carrying truncated AtDMCl promoter 
fused to the GUS reporter gene, transforming them into 
Arabidopsis and testing them for the ability to confer 
meiosis-specif ic GUS expression. These experiments will 
produce a truncated meiosis-specif ic promoter or even 
meiosis-specif ic enhancer sequences which will be more 
suitable for making constructs shorter than its present 
3 kb long version. Meiosis-specif ic enhancers may be 
fused to any heterologous "minimal" promoter, for 
example, with -67 bp 35S promoter, which is not able to 
drive transcription, but contains RNA polymerase binding 
site . 

The present inventors have made constructs carrying 
different types of fusion of the AtDMCl promoter to 
different transposase genes (Ac, Spm) as well as to the 
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Tntl tobacco retrotransposon. 

In the case of Tntl retrotransposon, part of 5 ' LTR 
upstream of retrotransposon TATA box was replaced with 
the AtDMCl promoter sequences located upstream of the 
5 AtDMCl TATA box. Transcriptional fusion within 

nontranslated leader of AtDMCl gene was made for Spm 
transposase and pAtDMCl : lOATG.Ac transposase 

translational fusion was performed within the second exon 
of AtDMCl gene, as it described for pAtDMCl:GUS construct 

10 (Figure 2B) . 

Isolation of the promoters of DMC1 homologues. from 
other plant species and testing their abilities as well 
as the ability of pAtDMCl to drive meiosis - specif ic 
transcription in heterologous plant systems can easily be 

15 envisaged. 
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Claims 



1 . A nucleic acid molecule comprising 

(a) the Arabidopis meiosis-specif ic - DMC1 
5 (AtDMCl) gene promoter; or 

(b) a meiosis-specif ic promoter homologous to 
(a) but from another plant species; or 

(c) a meiosis-specif ic promoter of the gene of 
a homologous DMC1 protein from another plant species; or 

10 (d) a meiosis-specif ic promoter variant, 

mutant, allele or derivative of (a), (b) or (c) ; or 

(e) a portion of (a) , (b) , (c) or (d) 
sufficient to confer meiosis-specif ic character to a 
promoter containing it. 

15 

2 . A nucleic acid molecule according to claim 1 wherein 
the homologous DMC1 protein is encoded by a nucleic acid 
sequence having at least 55% homology with the nucleic 
acid sequence of AtDMCl. 

20 

3. A promoter comprising at least a portion of a 
nucleic acid molecule according to claim 1 or claim 2 
sufficient to confer meiosis - specif ic character to the 
promoter . 

25 

4. A promoter according to claim 3 comprising all or 
part of the sequence shown in Fig. 4(A) . 



5 . A promoter according to claim 4 wherein the sequence 
30 is derived from nucleic acid which lies 5' of nucleotide 

4473 in the sequence shown in Fig. 4(A) . 

6. A nucleic acid construct comprising a meiosis- 
specif ic promoter according to any one of the preceding 

35 claims operably linked to a heterologous nucleic acid 

sequence . 
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7. A nucleic acid construct according to claim 6 
wherein the heterologous nucleic acid sequence is a gene. 

8. A nucleic acid construct according to claim 7 
wherein the heterologous gene encodes a cytotoxic 
protein. 

9. A nucleic acid vector comprising a promoter or 
nucleic acid construct according to any one of the 
preceding claims. 

10. A recombinant host cell containing a promoter 
according to any one of claims 1 to 5, or a nucleic acid 
construct according to any one of claims 6 to 8 or a 
nucleic acid vector according to claim 9, optionally 
integrated into the genome of the host cell. 

11. A host cell according to claim 10 being a plant 
cell . 

12. A method of producing a plant cell of claim 11 
comprising the steps of introducing a nucleic acid 
comprising a promoter, construct or vector into a plant 
cell; and causing or allowing recombination between the 
nucleic acid and the plant cell genome to introduce the 
sequence of said nucleic acid into the genome. 

13. Use of a promoter, construct or vector according to 
any one of claims 1 to 9 in the production of a 
transgenic plant. 

14. A plant cell comprising in its genome a promoter, 
construct or vector according to any one of claims 1 to 
9. 



or 



15. A plant, part or propagule thereof, seed, selfed 
hybrid progeny or descendant thereof comprising a plant 
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cell according to claim 16. 

16. A method of inducing meiosis-specif ic expression of 
a nucleic acid sequence in a transgenic plant cell 
comprising the steps of 

(a) transforming a plant cell with a nucleic 
acid comprising a promoter, construct or vector according 
to any one of claims 1 to 9 so as to introduce said 
nucleic acid into the genome of said plant cell so that 
expression of a nucleic acid sequence is regulated by 
meiosis-specif ic promoter; and optionally 

(b) regenerating a plant from said plant cell. 

17 . A method of producing a sterile plant comprising the 
steps of modifying plant cells by incorporating into 
their genome nucleic acid comprising a modification 
nucleic acid sequence and a promoter, construct or vector 
according to any one of claims 1 to 9 so that said 
modification nucleic acid sequence is expressed in said 
meiotic cells under the control of the meiosis - specif ic 
promoter, thereby altering the meiotic cell cycle and 
rendering the plant sterile. 

18 . A method according to claim 17 wherein the construct 
further comprises an insertion sequence flanked by target 
sequences and positioned in between said nucleic acid 
molecule and said modification nucleic acid sequence, 
said target sequences being under the control of an 
inducible promoter such that on induction the insertion 
sequence is eliminated. 

19. A method according to claim 17 or 18 wherein the 
target sequences are lox recombination sites which are 
activated by ere recombinase under the control of the 
inducible promoter . 

20. A method according to any one of claims .17 to 19 
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wherein the modification nucleic acid sequence encodes a 
cytotoxic protein which leads to the ablation of meiosis 
cells . 

21. A method of isolating DMC1 homologues comprising 
the steps of PCR on template nucleic acid using any one 
or more of the following degenerate primers 

MEI1- 5' GG(N) AA(GA) GT (N) GC (N) TA(CT) AT (ACT) GA 

3' ; 

MEI4 - 5' AC(N) GC(N) AC (GA) TT (GA) AA(CT) TC(CT) 
TC (N) GC 3'; 

MEI5 - 5' GC (GA) TG (N) GC (N) A (GA) N AC (GA) TG (N) 
CC(N) CC 3' ; and 

isolating said PCR product. 

22. A method according to claim 21 wherein the ME11 and 
ME15 or MEI1 and MEI4 primers are used in combination. 

23. A method according to claim 21 or claim 22 wherein 
the MEI1 primer is 5' GGA GGG AAT GGA AAA GTG 3' and the 
MEI4 is 5' GCA ACG TTG AAC TCC TCT GCA AT 3 ' . 

24. A nucleic acid molecule comprising any one of the 
following sequences for use as identification probes or 
PCR primers 

MEI1- 5' GG(N) AA (GA) GT (N) GC (N) TA(CT) AT (ACT) GA 

3' ; 

MEI4- 5 ' AC(N) GC(N) AC (GA) TT (GA) AA(CT) TC (CT) 
TC (N) GC 3 ' ; 

MEI5- 5' GC (GA) TG (N) GC (N) A (GA) N AC (GA) TG (N) 
CC(N) CC 3' ; 

MEI1U- 5' GGA GGG AAT GGA AAA GTG 3' ; and 
MEI4U- 5' GCA ACG TTG AAC TCC TCT GCA AT 3'. 



25 . 

of 

to 



A method of transposon tagging comprising the steps 
creating a construct comprising a promoter according 
any one of claims 1 to 5 operably linked to a 
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transposase required for transposition of a transposable 
element; transforming a plant cell with said construct 
such that the transposase is driven by said promoter; and 
determining transposition events. 

5 

26. A nucleic acid construct for selectively expressing 
a 

nucleic acid sequence of interest comprising 

a promoter according to any one of claims 1 to 5 
10 upstream of said nucleic acid sequence of interest; 

a nucleic acid insertion sequence flanked by target 
sequences and positioned in between said promoter and 
said nucleic acid sequence of interest; and 

an activating gene under the control of an inducible 
15 promoter such that on induction said activating gene 

expresses an activator which activates said target 
sequences thereby eliminating said insertion sequence 
allowing the nucleic acid of interest to be expressed 
under the control of the promoter. 

20 

27. A nucleic acid construct according to claim 26 
wherein the activating gene is a DNA recombinase gene. 

28. A nucleic acid construct according to claim 27 
25 wherein the activating gene is ere recombinase and the 

target sequences are lox recombination sites. 



29. A method for selectively expressing a nucleic acid 
sequence of interest in a plant comprising the steps of 
30 transforming said plant cell with a nucleic acid 

construct according to any one of claims 2 6 to 28; 

regenerating a plant from said plant cell; and 
inducing said inducible promoter. 

35 30. A method of producing seeds apomictically comprising 

the steps of modifying plant cells by incorporating into 
their genome a nucleic acid construct according to any 
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one of cla lms 26 to 28 so that said modification nucleic 
acid sequence is expressed in said miotic cells thereby 
eliminating sexually produced seeds in a plant 
regenerated therefrom. 

31. A method according to claim 30 wherein the construct 
is incorporated into the genome by breeding. 

32. A method according to claim 30 wherein the construct 
" lncor P°^ted into the genome by transformation. 

33. A method of detecting apomictic mutants comprising 
the steps of generating a plant or plant part carrying a 
nucleic acid construct according to any one of claims 26 
to 29; creating a variation for apomixis in accordance 
with a breeding programme wherein there is a natural 
variation for apomixis; deriving a suitable self progeny 
population; inducing said activating gene in the self- 
progeny an d detecting viable seeds produced 
apomictically. 

34. A method of detecting apomictic mutants comprising 
the steps of generating a plant or plant part carrying a 
nucleic acid construct according to any one of claims 2 6 
to 29; mutagenising said plant or plant part in 
accordance with a mutagenesis programme; - deriving a 
suitable self progeny population; inducing said 
activating gene in the self-progeny and detecting viable 
seeds produced apomictically. 

35. A nucleic acid construct for selectively removing 
one or more transgenes from the genome of a transgeneic 
plant comprising 

a promoter according to any one of claims 1 to 5 
operably linked to an activator gene for expressing an 
activator; and 

at least one target sequence flanking said transgene 
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and said target sequences being activated by said 
activator to eliminate said transgenes. 

36. A nucleic acid construct according to claim 35 
wherein the activating gene is a DNA recombinase gene. 

37. A nucleic acid construct according to claim 35 or 
claim 3 6 wherein the activating gene is ere recombinase 
and the target sequences are lox recombination sites. 

38. A method of selectively removing one or more 
transgenes from the genome of a transgenic plant 
comprising the steps of 

transforming a plant cell with a construct according 
to any one of claims 3 5 to 37; 

regenerating said transgenic plant from said plant 
cell; and 

recovering progeny from meiotic reproduction of said 
transgenic plant 
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FIGURE 1 
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FIGURE 2 
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FIGURE3. 

-440 

TAAAATTAATTTCATTAGTGGATCCGCAAACAAATATTAGAT TGG 

GCCTATATGCATC TATATTATTTTTATTTTTr^nTAATTTnAnTA 
-350 

AAA TCGGO:TATG GT (X:TATATGCAT CGGAATAATTA^TATArTC 

GGCTTA TGGGCCTATATGCAT TTGATTTTATCGATAAAATCTGAG 
-260 

TCAAATGTCTAATGTGCGCCGTTATGAAGTGCAAGTGGCTAATTT 

TTTTCACCTAGATTCCTTCTATTGACCGTCGATAGACGGATGATA 

-170 

ACTATGACGTGGCATTATCGCAGCXTATCAAACAAAGTCATGTATA 

ACAAACAAGAGCACACAAACGAAAACAAATTCAGTTC 
-80 

AAATTCAAATCAACGGAATTAGAATCACGCTTTCAATTCCGTAAC 



+ 1 



C C G CC^/TTAAAAAp C TTG AAC CC TCG AAG C AAATGGAG C AAAG AT 
+ 11 

TTTCAAATTTCGAATTTCAAAATTCTATCTCTCTCACTCTTCCAA 

GCTTAGAGAGTCTTAGAGCGAGAAAATGATGGCTTCTCTTAA GTA 
+ 101 M M A S L K 

AGTGATTGATCTCTCTCTTTCTCTCTACTACGATTCTTCTTCTTC 

TTCTCCATTCATCGTTTTGGTTTAAGCTTTGTCTTAAGTTTTGTG 
+ 191 

TACCTGACTCGCTTCTTCTOoTTTTTATTTTGTTTTCCGATGATC 

CTGATCTGTTTGTGTTGTTTCGGATTC ATAGA GCTGAAGAAACGA 
+ 281 A E E T S 

GCCAGATQCAGCTCGTTGAGCGTGAAGAAAATGATGAAGACGAAG 
+ 326 Q^QLVEREENDEDED 

ATCTATTTGAGATGATTGACAAA TGTA AGATTTGA 
L F E M I D K L 
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FIGURE 4 (A) 



GTTAACACCGTTTATATGAGACAAAATCAGCTATGAGATTACTCGTGTATCAATrCTCTA 

1612 + + + + + + - 1671 

CAATTGTGGCAAATATACTCTGTTTTAGTCGATACTCTAATGAGCACAl'AGTTAAGAGAT 

ATTAATTAAAAATAGTATAAATTAAATAATATAGTTCGATACACGAATATAATTGCGAAG - 
1672 + + + + + + . 1731 

TAATTAATTTTTATCATATTTAATTTATTAT^ 

AATAGGCATACAAATTTGTCATACATGTTTCGATAT^ 
1732 + + + + + + - 1791 

TTATCCGTATGTTTAAACAGTATGTACAAAGCTATACCGAGTGCTCCTCCGACTACGTTG 

AGTTTGATGTATACGTATGCAAATTGAGAAGTACTTGATCAGACCTATATATGTGATGCT 
179 2 + + + + + + _ 1851 

TCAAACTACATATC^ATACGTTTAACTCTTCATGAACTAGTCTGGATATATACACTACGA 

CGAACTTATCTTTTTGTTTTGGATCATCTATCGAATACAAIX^ 

1852 + + + 4- + + - 1911 

GCTTGAATAGAAAAACAAAACCTAGTAGATAGCTTATGTTACCATGATATTAAATTTACA 

TTTTTTTCTTCTTTTTCTTTAGTATCAAAAGC 

1912 + 4- + + Hr + - 1971 

AAAAAAAGAAGAAAAAGAAATCATAGTTTTCGTTGCAATCTACGATTTATTTC 

ATTGTG ATG ACTG AT AGTC TG AT AAT ATC ATT AACTTTGC AC C CG AAGTC AAAT AAAA 

1972 + - + + + + . 2031 

ACTAACACTACTGACTATCAGACTATTATAGTAATTGAAACGTGGGCTTCAGTTO 

GTGTTGATATTTATAAATTCCAACGAACGTTAATA^ 

2032 + + + + + + - 2091 

CACAAGTATAAATATTTAAGGTTGGTTGCAATTATTCGGTGTGGAT^ 

ACAATATTATAATAAAATTAAAAAAACTACGACTAAAGTTAATTTGC 

2092 + + + + + h — 2151 

TG TT ATAATATT ATTTTAATTTTTTTGATGC TGATTTCAATTAAACGATATT AAAACACC 

TATGTTTTAAAAATAAAGTTCTTTAGTTCTAATATCATGA 

2152 + + + + + 4-- 2211 

ATACAAAATTTTTATTTCAAGAAATCAAGATTATAGTACTTTTAAGTC 

TATGTAAAAAGGTTTTAGTACAATTCTTTTTTGTATATAACG 

2212 + + + + + +- 2271 

AT AC ATTTTTCC AAAATC ATGTT AAG AAAAAACATATATTGC CG TTTCAAGTT ATGTATA 

TTTACTATTG ATTTTTTTTAAAAAT AAAATAACAAl^ CAACTTTTTG AAGC ATAT 

2272 + + + 4- + + - 2331 

AAATGATAACTAAAAAAAATTTTTATTTTATTGTTAA 

TGATCGCAACTTAATTATAATTCTTCTITTTTTTCTTGGA 

2332 + + 4- + + +- 2391 

ACTAGCGTTGAATTAATATTAAGAAGAAAAAAAAGAACCTTCTAAT^ 

CAATGTGGAACAAATAAATGTAGAAATATTGTTATCACAAACTAATATATGATA'I"l v l , 'I*'l'r 

2392 + 4- 4- + + +- 2451 

GTTACACCTTGTTTATTTACATCTTTATAACAAT 

AATATTTTCATATATACTTTTGAGCTTCTGATGATATAACAGTTTTCATTAA 

2452 + + +- + + + - 2511 

TTATAAAAGTATATATGAAAACTCGAAGACTACTATATTGTCAA^ 

TTGTCGTGTACTAATTTTTCTTTTGTTC 

2512 + 4- 4 4 4- + - '2571 

AACAGCACATGATTAAAAAGAAAACAAGTTCATACACTATTTTTATACAACGTTTTAACG 
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FIGURE4 (A) continued 

GAGTTATTATAATGGTACAAATATGTAGAGAGAATACATGAGAAGAGTTAAAAGAAGCAT 
2572 4 + + + + + _ 2631 

CTCAATAATATTACCATGTTTATACATCTCTCTTATGTACTCTTCTCAAT^^ 



GCTTAAGCCAACAGAGAGTGGATCCAAATGTTGCTTTCCAGCTTTATACAAA 

2632 + + + + + + _ 2691 

CG AATTCGGTTG TCTC TC AC C T AGGTTT AC AACG AAAGGTCG AAATATGTTTGC ATAG TG 



CCACATTACTCCCACTGCTACATATATTGAAGGAGAGAGAGATGATGATCATAATGATAA 
+ + + + + + _ 

GGTGTAAT<^CGGTCACGATGTATATAACTTC 



ATCGATGTCGATGATAAATTGATGATGATGGCTCCGGTATGTGTACCCTAGGAGTTGTAG 
2752 4- + + + + + _ 2811 

TAGCTACAGCTACTATTTAACTACTACTACCGAGGCCATACACATGGGATCCTCAACATC 

CTAGCTAGCTAGGACCA1^3TATATACATACATACATATATTAGTGTTTTTTGTAACTTGT 
2812 + + + + + + _ 2871 

GATCGATCGATCCTGGTACATATATGTATGTATGTATATAATCACAAAAAACATTGAACA 

ACGTACCTTACAAACAGTATGGAGTTTACTAAAACGGCAACGTTTGGTGGGGGTAGTGAA 

2872 + + + ■+■ + + - 2931 

TGCATGGAATGTTTGTCATACCTCAAATGATTTTGCCGTTGCAAACC 

TTCGCAAGTGGGGATGAGTCTATGTAATAGAAGATGCAACATGCAAATGGTCCCCTTTCT 
2932 + + + + + + _ 2991 

AAGCGTTCACCCCTACTCAGATACATTATCTTCTACGTTGTACGTTTACCAGGGGAAAGA 

GTTTTTATTTAAAGAAATTAGTGTTTACTGAGGAGGAAACATC 

2992 + + + + + + _ 3051 

C AAAAAT AAATTT CTTT AATC AC AAATG ACT C CTCCTTTGT AGGG T AAAT ATC T AAGTG T 

GCCATAAAAC<LAAACCACTTCTCCTTCTTTTT 

3052 4- + + + + +- 3111 

GGGTATTTTCGTTTGGTGAAGAGGAAGAAAAATAAAGGGGTACT 

TTTTGAAAATTTGAAGTGTACATTTAGAGATT^ 
3H2 + + + + + + _ 3171 

aaaacttttaaacttcacatgtaaat<^taag^ 
atctaaatatataaactccaatttaaaataat^ 

3172 4- + + + + + . 3231 

tagatttatatatttgaggttaaatt^ 
gccat/ttxsttggtcttcatt^ 

3232 + + + + + 3291 

CGGTAAACAACCAGAAGTAAAGAGTACGAAACTAATGT^ 

AAATCATCTGCATAAACTAAGAAATAGCTAC^ACATAAAATTT^ 

3292 + + + + + + - 3351 

TTTAGTACACGTATTTGATTCTTTATCGATC^ 

TACTATGTTCACTTTAAGAGAAAAAAAAACH^ATGGCAAAA^ 

3352 + + 4- 4- + + - 3411 

ATGATACAAGTGAAATTCTCTTTTTTTT^ 

TATCATAATCAAAGTGCATAT<nX3AAGTCAGAGGCAACTC 
3412 + + + + + + _ 3471 

ATACTATTAGTTTCACGTATACACTTCACTCTCCGT^ 



C AAAGAAAATTTTTAAATATG AG AAAAAATTATATAAAAAC^^ 

t 4- A H + 4 - 

GTTTCTTT/TAAAAATTTATACTCTTT^ 



CTTTTK3ATATAGGGAGATTCGTTGAGCATCC 
3532 + 4 + 4 4 4- 3591 

GAAAACTATATCCCTCTAAGCAACTCGTAGGTACAOSAGA 
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FIGURE4 (A) continued 



CTGTATCTAGCCAACCCACATATACCTTTACACTAGAGAACTTCGATGATTC 

3592 + + + + + + - 3651 

GACATAGATCGGTTGGGTGTATATGGAAATGTGATCTCTTGAAGCTACTAAGAAAAAGG'I , 

AAATCAATGTGATATAATATAATTAAGCATATATGCATAAAAAATGAAGAAGAATGGTAG 

3652 + + + +-- + + - 3711 

TTTAGTTACACTATATTATATTAATTCGTAT^ 

AGTCATGTTACTTAAGGTCATGGTGTGTAAAAACATTGATACTTTACAATAT^ 

3712 + + + + + +- 3771 

TCAGTACAATCAATTCCAGTACCACACATTTTTGTAACTATGAAATGTTATATACTCAAC 

TGAAGTGCTCTTAAAGTTATAACATCCGGTTCTACGTATTGACCTAGAACTAGAAGAATC 

3772 + + + + + + - 3831 

ACTTCACGAGAATTTCAATATTGTAGGCCAAGATGCATAACrGGATCTTGATCTTCTTAG 

3832 + + + + + + - 3891 

CAAAAAATCAGGTTTAGTTTAGTTCAGCCAAGAAATAGTCAAAACAACATACACTTAATT 

TTTGAAAATATTAGCTATGATCTTAGCT^ 

3892 + + + + + + - 3951 

AAACTTTTAT AATCG ATACTAGAATCG AACCC AAAAAC AAAG ATTCC C AATTC CTAGTAT 

TCTCTTTGTCAAATGACATGTGGTCTATATC 

3952 + + + ■+- + +- 4011 

AGAGAAACAGTTTACTGTACACCAGATATACAGTACTTAATCCGTGGCGATAGAAAATGA 

ATTGATTCGACGACATTGGGACTCCTCACTACACTTATCTTA 

4012 + + + + + + - 4071 

TAACTAAGCTGCTGTAACCCTGAGGAGTGATCTGAATAGAATTTT 

GTTAATGGCTTGTCACCATAAACTTTCATGAGCTCT^ 

4072 + + + + + -t— 4131 

CAATTACCGAACAGTGGTATTTGAAAGTACTCGAGATTCT 

AGGTCTCACAATATATACAATTTCGAGGGATAAATATTTCAAAAGGATAATATGATAGTT 

4132 + + + + + + - 4191 

TC C AG AG TGTTAT AT ATGTT AAAGCTCCC T ATTT ATAAAGTTTTCCTATTAT ACTATC AA 

GGTAGAAATGTATAGTTTCTAGTAATAATAGAGATC 

4192 + + + + + +- 4251 

CC ATC TTT AC AT ATC AAAGATCATT ATT ATC TCT AGC AAC CAATTTG AGGGGTTG AAAAA 

AAAATTAATTTGATTAGTGGATCCGCAAACAA^ 

4252 + + + + + +- 4311 

TTTTAATTAAACTAATCACCTAGGCGTTTGTTTATAATCT 

ATTATTTTTATTTTTGTGTAATTTCAGTAA 

4312 + + + + + +- 4371 

TAATAAAAATAAAAAGACATTAAAGTCATTTTACCCGGATACCAGGATATACGTAGGCTT 

TAATTAGTATACTGGGCTTATGGGCCTATATGCATTTGATTTTATCGA 

4372 + + + + + + _ 4431 

ATTAATCATATGACCCGAATACCCGGATATACGTAAACTAAAATAGCTATTTTACACTCA 

CAAATGTCTAATGTGCGCCGTTATGAAGTGCAAGTGGCTAATTTTT^ 

4432 + + + + +■ + - 4491 

GTTTACAGATTACACGCGGCAATACTTCACGTTCACC 

TTCTATTGACCGTCGATAGACGGATGATAACTATGACGTGGCATTATCGCAGCCATCAAA 

4492 + + + + + + -'4551 

AAGATAACTGGCAGCTATCTGCCTACTATTGATACTGCACCGTAATAGCGTCGGTAGTTT 
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FIGURE4 (A) continued 

CAAAGTCATGTATAACAAACAAGAGCACACAAACGAAAACAAAT-rCAGTTGCGGAACCCA 
4552 + + + + + + _ 4611 

GTTTCAGTACATATTGTTTGTTCTCGTGTGTTTGCTTT^ 

AATTCAAATCAACGGAATTAGAATCACGCTTTCAATTCCGTAACCCGCCATTAAAAACCT 
4612 + + + + + + _ 4671 

ttaagtttagttgccttaatcttagtgc^ 

TGAACCCTCGAAGCAAATCGAGCAAAGATTTTCAAATTTCGAATTTC 
4672 + + + + + + _ 4731 

ACTTGGGAGCTTCGTTTAGCTCGTTTCTAAAAGTT^ 



TOTCACTCTTCCAAGCTTAGAGAGTCTTAGAGCGAGAAAATGATGGCT 
4732 + + + + 4779 

AGAGTGAGAAGGTTCGAATCTCTCAGAATCTCGCTCTTTTACTACCGA 

FIGURE4 (B) 

4473 TTTTTTTCACCTAGATTCCTTCTATTGACCGTCGATAGACGGATGATAAC 4522 

Nil ! IIMMIIM Mllllli IIIIMi [Mill 1 1 IMIIIIMI M 

1120 tttttttcacctagattccttctattgaccgtcgatagacggatgataac 1169 
4 523 TATGACGTGGCATTATCGCAGCCATCAAACAAAGTCATGTATAACAAACA 4572 

, NIIIIIIIIIIMIIIIIIIMIIIIIIilllliiiiMMMIIII I 

1170 tatgacgtggcattatcgcagccatcaaacaaagtcatgtataacaaaga 1219 
4 573 AGAGCACACAAACGAAAACAAATTCAGTTGCGGAACCCAAATTCAAATCA 4622 

MIMIMIIIIIIIMIIIIMMIIIIIIIMIiMllilllllllll 

1220 agagcacacaaacgaaaacaaattcagttgcggaacccaaattcaaatca 1269 
4 623 AC . GGAATTAGAATCACGCTTTCAATTCCGTAACCCGCCATTAAAAACCT 4671 

o 11 IIIIIIIIIINilNMMMMMNIIIIIIIIIIIIIIMll! 

1270 acgggaattagaatcacgctttcaattccgtaacccgccattaaaaacct 1319 

4672 TGAACCCTCGAAGCAAATC 4690 

I ill I I MIIIIM II li'l 
13 20 tgaaccctcgaagcaaatc 1338 
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GAACATAAAATTTAAAAAATGACTCTTGGGGATAGCAAAAATACATTTCGGGAATGATCG 



I + + + 4- + + 60 

CTTGT ATTTT AAATTTTTT ACTG AG AAC CCCT ATCGTTTTT ATG T AAAGCCCTT ACT AGC 

TGTTCGTTTTCTCATGCGATATGACAGTAAAAAGGATAAAAATA 

61 + + + + + + 120 

ACAAGCAAAAGAGTACGCTATACTGTCATTTTTCCTATTTT^ 

GTTGTAGGTGTGTGAACATGATTATATATAACATTTATGAAAGCATCCACATATACAAAA 

X21 + + + + + + 180 

CAACATCCACACACTTGTACTAATATATATTGTAAATACTTTCGTAGGTGTATATGTTTT 

CAATGGCCGGACATTTATCCAAAGCACACAATCTGCACAGTCGTATAATAGCACACATCC 

181 + + + + + + 240 

GTTACCGGCCTGTAAATAGGTTTCGTGTGTTAGAC 

CAATGAGAAAATAAAAAGAAGGGAGGACATAAAAATCCACATGTGATAACACGCTCATGC 

24i + + + + + + 300 

GTTACTCTTTTATTTTTCTTCCCT 

TCCGCAAAAAAAAAGATAGCACACTGATGCAAAAACGAGAATCAACACACATGCCACACG 

301 + + + + + + 360 

AGGCXrrTl^iTTTTTTCTATCGTGTGAGTACGTTTTT^ 

AAAAAC AAT AATGC CTCTGAAGG AGAGC ACAAAACTTG AGGCGTTCC ATAC AGCAATG AA 

361 + + + + + + 420 

TTTTTGTTATTACGGAGACTTCCTCTCGTCTT^ 

AAACTTAGATGTGCATTAGATAATACTCATCTAAAATTATCATCATATA 

421 + + + +■ + + 480 

TTTGAATCTACACGTAATCTATTATGAGTAGATT^ 

TTAATATACCTTCTTTCGATCAAATAAATAAAATATACAAATAAAAGTTAAAAAGTTAAA 

481 +■ + + ■+• + + 540 

AATTATATGGAAGAAAGCTAGTTTATTTATTTO 

CTAAAGGACTTGAAAATGACACAAG^WW^TAAAATAGAAACAACAAAAAGACAAAACAA 

541 + + ■+- + + + 600 

GATTTCCTGAACTTTTACTGTGTTCTTTT^ 

CAATAATAGCATCGCTTAGCCGTTTGCACCIATGCACCCCTCCTCGA^ 

601 + + + + +■ + 660 

GTTATTATCGTAGCGAATCGGOWVCGTCGTAC 

CGCTCCAACAGATAACATCAAGACAACAAAATGAAAATGGGCTO 

661 + + + + + + 720 

GCGAGGTTGTCTATTGTAGTTCTCTTGT 

GCAGTATAGTGGAACTACCACAAATCTAAAAGTTGTATTGAACGAAGTAGGGCGTAATTG 

721 + + +- + + + 780 

CGTCATATCACCTTGATGGTGTTTAGATTTTCAACATAACTTG^ 

AGACTTTGTTAAATCTCAGTCAACTGAGATTCAATGCATCCATAAACC 

781 + + + + + + 840 

TCTGAAACAATTTAGAGTCAGTTGACTCTAAGTTACGTAG^ 

AACACTTACATAAATAGTTTTTATTGAAGCAATCATATATTT 

841 + + +- +- +- +■ 900 

TTGTGAATGTATTTATCAAAAATAACTTCGTTAGTATAT 

T AG AAATT AC AC AT ATG ATC ACCGT ACG AGAC AC AAATGC TT<TTG AAGC AACTTT^ 

901 + + +. + + + 960 

ATCTTTAATGTGTATACTAGTGGCATGCTCTGTGTTT 
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FIGURE 5(A) continued 



961 ^^^^^^"^^TCCTCACAGAAA^CATCCCTCGAO^ 

TTTTTTAAGTGTTCTTCATTTTGA'TCATA^GGTAGGA 1020 

TGAGCAGTCCACAAGCGTTTTTCACCGTCACCATGACAGCACCGGAAAAAAAAAG 



1021 + _™^ i ^ s - v -"^^AL^/\uc(J^AAAAAAAAAGCAATG 

ACTCGTCAGGTGTTCGCAAAAAG^TGGCAGTGGTA^^ 1080 

1081 ^.!-?^!f?!?!^^^ 

AGTATCGTICTAGAGCCTTTCGTGGTGGTGTCTCA'I^ACT 114 ° 

1141 ^^^^^CGAGAC CAGGTCITTC^ATOTAGGCCGGGGATCCTCCTCGTG 

CCAG ™CCATCGATGCTC^ 1200 

1201 f^f^^???^^^ 

^ATCAGCTTCTTCTCCTTGCAGTCG 



CAGTGGTCTAGAGCTACCTCAATTGAAGTAAGCTAC" — + + 1260 



1261 

CGGGGCAAGTTTTAAAACATAAAGC7TACCCGGATCGACCGT " 20 

1321 ™!!!^If^^ 

GAACCACAATGCTAGGTGTTCG'TCTrcA^GGAGCGCCCCCG 1380 

1381 f??!!?^^^ 

C^GTCGGAGTCACTAGGTG^^A-c^G^^;^-^^^^ 1440 
1441 ^CCAACTCC^CAAGC CGCACGCGAGCCTCCGCCTCGTrACAGGTCTCGTCCTCC 

GGGGTTGAAGGAAGAGGTTCGGCGTGCGCT^GGAGGCGGAG 1500 
1501 ™--.^^^CC^ 

**«»^^ 1560 

is6i f!^f?^??f!?!?^^ 

1620 

1621 

wck**™^^ 1680 

1681 ^^^ACAAG^ACGTC^ 

AGCTCAGGTAACTGTTCACATGCAAGCTGCGGAGGATGGGGA " 4 ° 
1741 -ffffffff^f^ 

1800 

1801 ^T!?^!???? 0 ^^ 

\GCGCGCAAAGCCAACGTTAAAGC 
1861 , ~ 1 1 ^ J-J'AGCCTXTCACATTTGGTTGGTTTTT 



gcggcaacgcaaggcccacggaggcgga'gaga'^ a ^c" + + + 1860 



TGCTGTTTGGAGGTAGACTTCGTGCGGATTTTGCTTAG 

+ + + ♦ 1920 

ACGAC ^CTCCATCTGAACCACGC CT 
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FIGURES (A) continued 

GGTGCGTGCAGCGTAGTCGCGTGCTCGTGTTGCGTTGGTTTC 

1921 + + + + + + 1980 

CCACGCACGTCGCATCAGCGCACGAGCACAACGCAACCAAAGCCAAAAGAGGGGATAAAG 

GTGC AATCTGGGG AGC AACT AGCGCGCTTTGGT AGC CT AATTTTGTGC AATCC ACGG ATT 

1981 + +-. + + + + 2040 

CACGTTAGACCCCTCGTTGATCGCGCGAAACCATCGGATTAAAACACGTTAGGTGCCTAA 

AGTCTGAACTTAGCCCCATTTTGTGCCAA 

2041 + + + + + + 2100 

TC AG ACTTG AATCGGGGT AAAAC ACX^TTGT ACGC AAATT AGGC C GC AGCGCT AGTG ACG 

CTGT AG AATC CTTTC G AATTCC AAACT AG ATATAGTTC AATCGC ATGTGTTTTCTG TGG A 

2101 + + + + + + 2160 

GACATCTTAGGAAAGCTTAAGGTTTGATCTATATC^AGTTAGCGTACACAAAAGACACCT 

AACTTTGACCAATTACGAACGCGTTTTCGTGCACTTCGAGGCGTAGGGTTTTA 

j ioJ + + + + + + 2220 

TTGAAACTGGTTAATGCTTGCGCAAAAGCACGTGAAGCTCC 

: ^ACTCATCTCTTCXTACATTTTTTTCTC 

j + + + + + + 2280 

GAATGAGTAGACAACGTGTAAAAAAAGACTTACAAGTAGACAAGACAAGGTATTAATGCC 

GTAGCTTCTGTGAGCTTAGCAAATTTXSATTTA 

22t;j + + + + + + 2340 

CATCGAAGACACT^GAATCGTTTAAACTAAATTTGTAAACGGGATCTACACTAGGGCGCG 

CTGTGTTTTGTTTGCGTGATCCAGTTGGTT^ 

2341 + + + + + + 2400 

GACACAAAACAAACGCACTAGGTCAACCAAGAAAGCACGTGGGAACAAAGTAAGGGTACT 

ATTTG AATCCTAAAC C AAG ATGTG ATC C AC C CTTGC TTGT AATAC TGT AACT AATC ACTC 

2401 + + + + + + 2460 

TAAACTTAGGATTTGGTTCTACACTAGGTGGG 

GTCCAAACAGTX5ATCACGCAGGGGATAAACGCAGGAGACGTGAAGAAGCTGCAGGATXXTG 

2461 -r + + + + + + 2520 

CAGGTTTGTCACTAGTOCGTCCCCTATTTGCG 

GGGATCTACACTTGCAATGGCCTGATGATGCAG^ 

2521 + + + + + + 2580 

CCCTAGATGTGAACGTTACCGGACTACTACX3TGTGGT 

TAAATCTGCATGCCTTCTCCTTAAAATTTC 

2581 -f- + + + + + 2640 

ATTTAGACGTACGGAAGAGG AATTTT AAAC AC CAACGT ATG ACTTTC AACT AAAGACCAAG 

ATGACGCATGACTGCTATTAGTTCTTTTATG 
2641 + + + + + + 2700 

tact<x:gtactgacgataatcaacaaaatacataagctcaaagttgtgtactaactacaa 
gtttcaatatcaatgtatggattcatcttgcaga 

2701 + + + + + + 2760 

CAAAGTTATAGTTACATACCTAAGTAGAACGTCTCGGAATGTCCCTAATTCCCGAACAGA 

GAAGCAAAGGTTGATAAGATCTGCGAGGCTGCTGAGAAACTTC 

2761 + + + + + + 2820 

CTTCGTTTCCAACTATTCTAGACGCTtCGACGACTCTTTG 

TCTTTGCATTGATTXTTGATTTAGAACTTCTC . 

2821 + + + + ♦ + 2880 

AGAAACGTAACTAAGACTAAATCTTGAAGACACGGTATAATAAGAGAAACGTAACTAAGA 
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FIGURES (A) continued 

GGTTTACAACT7'CTGTGCCATCCGTGCTATGTGATAGAGTGTTCTAAAGTTTX3TATTCTA 

2881 + + , 

+ + h + 2940 

CCAAATG1TX5AAGACACGGTAGGCACGATACACTATCTCACAAGATTTCAAACATAAGAT 

294i r^ Arr ^ AGACT ^^ 

AAACTAAAAGTCTCAGTCCCAAAGTACTGTCCTTCACTAGAGGAATAATTCCATTCCA^ 30 °° 
AGGAGCTAAGCTACTGATGAAGGACGATCATACAGTTTAGTGCTC 

TC CTCG ATTCGATG ACT ACTTCCTGCT AG TATGTC AAATC ACG AC AAAAC CG TT ATC ACT 306 ° 

TACTATGCTAAACTTACTGTAAACAGTAGTTTGTTTTTGGA 
3061 h 1 H ^ ^ 3120 

ATGATACGATTTGAATGACATTTGTCATCAAACAAAAACCTTAGACGAACT^ 

31^1 TOAC ^ CACCTC ^ AC ^ CCOT ^ T ™ 

AATGAAGTGGACACGTGACGGGCCAACAAACTAAATGAAAAGACATATAACT^ 318 ° 

TCTGTTGTCCGGATTACCACTGGGAGCCAAGCACTTGATC 
3181 h 1 + H + + 3240 

AGACAACAGGCCTAAT<^TGACCCTCGGTTCGTC 

TGTCGTCCTTGATTCTGTTCTGATT 
3241 + + + + + + 33Qo 

ACAGCAGGAACTAAGACAAGACTAATAAGGACTACAATACGAGATTGGATAATTGTATAA 

TCCATAATTTGAAGGAGGGATTGAAACTCTCTGTATC 
3301 + + + + + + 336Q 

AGGTATTAAACT^CCTCCCTAACTTTGAGAGACATAGTGTCTCC 

GTCAGTAAATATTCCGGTTACCTTTTTCTTCTGGGCTTCT^ 
3361 + + + + + + 342Q 

CAGTCATTTATAAC^CAATGGAAAAAGAAGACCCGAAGAATAAAAAAACGTCGCTC 

CAATATATTAGTTGTATATCTGATATGTATGATTTGT^ 
3421 + + + + + + 34g0 

GTTATATAATCAACATATAGACTATACATACTAAACACX^TATAAACCGTCATCT 

TTAGTGCTAGTAATTTTTTAGTGCGTGTT^ 
3481 + + + + + + 354Q 

AATC ACG ATC ATTAAAAAATC AC GGACAACC ATTTTG AG A^ 

TTTTGTACTGTCTAGTCTTCGAGAGGTATTGATG 

3541 + + + , ^ 

+ + + + 3600 

AAAACATGACAGATCAGAAGGTCTCCATAACTACCATCACGTATA 

GTTATTGCTGTGATGCTAACGCTGAATACTGGATACCTC 
3601 + + + + + + 3660 

CAATAACGACACTACGATTGCGACTTATGACCTATGGACAATC 

TATGTACATCAATATTATTATTCACTATTTCCCTGAGGCAGCACAC 
3661 + + + + ^ + 3720 

ATACATGTAGTTATAATAATAAGTGATAAAGGGACTCCX3TCGTGTC 
3? 2 ' m ^ cc ATGAGCACAATTGTAACACTGAC^ 

AAAAAGGTACTCGTGTTAACATTGTGACTGA 3780 
^ o , ^^^ CAG ^CAACTTTCTAGC^ 

3 /Rl h + ^ ^ 3ft40 

GACTTAAGTCAAGGTTGAAAGATCGCAATTTTAGGTATAAT^ 



WO 98/28431 



12/18 



PCT/GB97/03546 



FIGURES (A) continued 



CAGGGAAGACCCAGTTGCXrTCATACTCTTTGTC 

3841 + + + + + + 3900 

GTCCCTTCTGGGTCAACCGAGTATGAGAAACACAGAGGTGAGTCCAGGTAAAGGACGGAA 

GTATTTCATCGATGTAACCTCACTACAACAGAATCCATGA 

3901 + + + + + + 3960 

CATAAAGTAGCTACATTGGAGTGATGTTGTCTTAGGTAC 

AAATGTATAACAGCTTCCACTCCACATGCATGGTGGGAACGGGAAGGTT^ 

3961 + + + + + + 4020 

TTTACATATTGTCGAAGGTGAGGTGTACGTACCACCCTTGCCCTTCCAACGGATGTAACT 

C ACTG AGGG AAC ATTGT ATC CTTTGG ATTC TTC AA 

4021 + + + + + + 4080 

GTG ACTCC CTTGT AAC AT AGG AAAC C T AAGG AATCATT ATGG AT ATCG ACT AAAC AAG TT 

TGAATTCATTTACATTTCTATATTTCTCGAAACrrc 

4081 + + + + + + 4140 

ACTTAAGTAAATGTAAAGATATAAAGAGCTTTGACAAAG^ 

CCTGAACGCATTGTGCCAATTGCTGAGAGATTTGGGATGGATGCCAA 

4141 + + + + + + 4200 

GGACTTGCGTAACACGGTTAACGACTCTCTAAACCCTACCTACGGTTACGACAAGAACTG 

AATGTATGGCTC CT ATTACATCTCTCTTG AC CC ATTTAAGGAAG AACGG ATC AAC ATCTT 

4201 + + + + + + 4260 

TTACATACCGAGGATAATGTAGAGAGAACTGGGTAAATTCCTTC 

TGTTTAATTCGTGATCTTTCTGTTTTAGATCATATA 

4261 + + + + + + 4320 

ACAAATTAAGCACTAGAAAGACAAAATCTAGTATATGCGAGCGCGTATGTGGATACTCGT 

CCAGTACAACTTACTCCTGGGCCTTCX^ 

4321 + + + + + + 4380 

GGTCATGTTGAATGAGGACCCGGAACGACGGTTCTACCGAC 

GGTACGCATGACTTTGCTGCCATGTAAATTTAGA^ 

4381 + + + + + + 4440 

CCATGCGTACTGAAACGACGGTACATTTAAATGTTAACTATC 

TGATCTTTGTTTGACTTGGAAATGATAGATTG 

4441 + + + + + + 4500 

ACTAGAAACAAACTGAACCTTTACTATCTAACACCTAAGAGACT 

TGATTTCAGTGGTAGGGGTGAACTTGCAGAGCGTC^ 

4501 + + + + + + 4560 

ACTAAAGTCACCATCCCCACTTGAACGTCTCGCAGTCCATAA 

TACGTGAAAAAATCAAGCAACTCAGGAAGTAGTCGAAGGCTTGCATTT^ 

4561 + + + + + + 4620 

ATGCACTTTTTTAGTTCGTTGAGTCCTTC 

TAAGTGATGTGTTCTGTGACTGCAGCAAAAACTGGC 

4621 + + + + + + 4680 

ATTCACTACACAAGACACTGACGTCGTTTTTGAC 

GATTGCTGAGGAGTTCAATGTTGCAGTGTACATCACC 

4681 + + + + + + 4740 

CTAACGACTCCTCAAGTTACAACGTCACATGTAGTGGTTGGTTC 

AATCCTATCTTTTCCAAGAAAGAGCT 

4741 + + 4766 

TTAGGATAGAAAAGGTTCTTTCTCGA 
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FIGURES (B) 



AAATTGTGAAGCACTTTCTATAAAAATATTTAAACATAGATG 
1 + + + + + + 6() 

TTTAACACTTCGTGAAAGATATTTTTATAAATTTGTATCT 

GTATCATATAATATTTATATGCGATGATGATTAGGCGGACCTCTCTACGTGAGGCTAGAA 
61 + + + + + + 

CATAGTATATTATAAATATACGCTACTACTAATCCGCCTGGAGAGATGCACTCCGATCTT 



ATTTGAATATTTTACTTACCTTGTTTTGTTTTO 

^ I H + H + 

TAAACTTATAAAATGAATGGAACAAAACAAAAAGGAAAGAT^ 

AAGTGGTTTGGATAGATAAAAAATAATTTTGGACCTAAT^ 

+ + + + + ^ 

TTCACCAAACCTATCTATTTTTTATTAAAACCTGGATTAAAATTTTAA^ 

GTAATAAGATTCTAAATGTTAAAACCAATAACGGATATCTTAGATCCACCTTTT^ 

f- i h -I f- 1- 

CATTATTCTAAGATTTACAATTTTGGTTATT^ 



TAGTGCACCCATCATCTCAAAATTTTGGATACACCT 

! h h ^ j h 

ATCACGTGGGTAGTAGAGTTTTAAAACCTATGTGGAGACGAAGATCTAATGAAGACTAGG 

TTTATATGCTTTATGCTGAATCATGATATCA 

+ H + + + + 

AAATATACGAAATACGACTTAGTACTATAGTAAGCCTCAAATTGTTCTAACGGTTAACTA 



360 



420 



TTGTCTGATTTACTGCAGCTTCCGACTAGTATGAAA 
421 + + + + + + 480 

AACAGACTAAATGACGTCGAAGGCTGATCATACTTTCC 

ATTGATACTGAGGGAACATTGTTTCCTTGCrAATATT^ 
461 + + + + + + 540 

TAACTATGACTCCCTTGTAACAAAGGAACGATTATAA^ 



541 



TAGCACCTATTACTCTCTTCATTAAGTAGCAGCTGCAGAAAC^ 

"f 1- + + ^ + 

ATCGTGGATAATGAGAGLAAGTAATTCATCGTCGLA 

CTTCCATTTTATCTTTTTTCCTCAACCAAGCGTACT 

h i h 1 4 i. 

GAAGGTAAAATAGAAAAAAGGAGTTGGTTCGCATGATGTCAGC^ 

CCATTGCTGAAAGATTTGGAATGGACGCTGGA 

+ I + 1 4 + 

GGTAACGACITTCTAAACCTTACCTGCGACCTCGTCAA 



600 



721 



TACACCCACATTTAATCATCTACTGCTCTTTGTTTAGTGTACTGATTT 

+ + + -+- + + 

ATGTGGGTGTAAATTAGTAGATGACGAGAAACAAATCACATGACTAAAGAATAGGAAAGA 



780 



TTCCTTATTATGTGATCAGATCATTTATGCTCGC 
781 + + + + + + 840 

AAGGAATAATACACTAGTCTAGTAAATACGAGCGCGTATGTGTATACTTGTAGTTATATT 



CC TGCTTCTTGGTCTGGC AGC AAAAATGGCTG AAG AGCC TTTC AG ACTTCTGGTG AAAGC 

H I H 4- * 4 h 

GGACGAAGAACCAGACCGTCGTTTTTACCGACTTCTCGGAAAGTCTGAAGA 
CACATCATCTGCTTTATCTTGAATAAGACCATTACTGGCGGCAGT^ 

+ 4 _ + 1 + 

GTGTAGTAGACGAAATAGAACTTATTCTGGTAATGACCGCCGTCAACAGAGTCTATGACT 
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FIGURES (B) continued 

AATTTTACTTGCAGATTGTTGACTCTGTGATTGCTTTA 

961 + + + + + + 1020 

TTAAAATGAACGTCTAACAACTGAGACACTAACGAAATAAAGCTCACCTAAA^ 

GAGGAGAGCTTGCAGACCGTCAGGTATAACTAAATACACAAGCATAATATTTGATTAATT 

1021 + + + + + + 1080 

CTCCTCTCGAACGTCTGGCAGTCCATATTGATTTATC 

AAAAACCTATCTCTGATATTTATCTGTGTTGAGAAGAACCTGCAATC 

1081 + + + + + + H40 

TTTTTGGATAGAGACTATAAATAGACACAACTCTTCTT^ 

GACTTTTTCTGAATGCTTACGCCTTCTTGC^ 

1141 + + + + + + 1200 

CTGAAAAAGACTTACGAATGCGGAAGAACGGTAAAGTCGTTTTCAACCG 

CACGATTGATAAAGATAGCTGAGGAATTTAATGTTGCTGTTTACATC 

1201, + + + + + + 1260 

GTGCTAACTATTTCTATCGACTCCTTAA 

TACAATCCAACCTTGCGTTTTACAAAATGTATTTGTTTAGATTTAA 

1261 + + + + + + 1320 

ATGn?^ CTITCG AACGC^ AAATGT': TTACATAAACAAATCTAAATTGCGTCTTTCTTGTT 

GTGTATAGATACACTGCTACTTCCTAAGTGTCGATGTAAATGGAT 

1321 + + + + + + 1380 

CACATATCTATGTGACGATGAAGGATTCACAGCTACATTTAC 

CCTATGTAATACTCTTTGCATGGCACTTTGCAAACAGCTA 

1381 + + + + + + 1440 

GGATACATTATGAGAAACGTACCGTGAAACGTTTGTCGATTTGTCTGAAAGAAAAATACA 

TACTGTATATTGATTGCCTTTCCTCGGACATTGAAAACTCTAC^ 

1441 + + + + + + 1500 

ATGACATATAACTAACGGAAAGGAGCCTGTAACTTTTGAGATGGGTAC 

TTCTCTTATTATTGAATGCAATTCCATCTATTCCCCTAATCCACG 

1501 + + + + + + 1560 

AAGAGAATAATAACTTACGTTAAGGTAGATAAGGGGATTAGGTGCCACCACACAAGTATA 

CAGATCCAAAGAAACCAGCAGGAGGCCATGTCCTTGC 

1561 + + + + + + 1620 

GTCTAGGTTTCTTTGGTCGTCCTCCGGTAC^ 

TGTTCAGGAAGGGCAAAGGAGAACAGCGTGTCTGCAA 

1621 4- + + + + 4- 1680 

ACAAGTCCTTCCCGTTTCCTCTTGTCGCACAGA 

CAGAGTCTGAAGCGATATCCTTTTTCTOATTATArTAGTTT^ 

1681 4- + + + 4- 4- 1740 

GTCTCAGACTTCGCTATAGGAAAAAGAATAATATAATCA 

GATTGTGATGTTTAGAGACAAGAAGAAAGCTCCACCCAATA 
1741 + + + + + -t- 1800 

CTAACACTACAAATCTCTGTTeTTCTTTC 

TCTTCTTTCCACTCAGTCGAAATTAGTCCAGTGTTAAACT 

1801 + + + + 4- + 1860 

AGAAGAAAGGTCACTCAGCTTTAATCACGTCACAATTTGATCTG^ 

TCCTTGACGAAAAACACATCTTCCAGATAACAGCAGGAGGTATTGCTGATGCT 

1861 + + + 4- + 4 1920 

AGGAACTGCTTTTTGTGTAGAAGGTCTATTGTCGTCCTCCAT ' 
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FIGURES (B) continued 

GAAAACCAGCTTGGACTTGCAACAGAGAGCTGCTTATGTTCTCT 
1921 + + + + + + 1980 

CTTTTGGTCGAACCTGAACGTTGTCTCTCGACGAAT^ 

GC TT AC ATTTTG AGG TT AGCT AT AAATAT ATT AATTTTGC AT AAA T AT AGTTTGT AAG AC 

1981 + + + + + + 2040 

CGAATGTAAAACTCCAATCGATATTTATATAATTAAAACGTATTTATATCAAACATTCTC 

AATGTTTTCTTACACTTATCTTGGTTGAGTGCTGTGTTATCGGTAATATAATT 

2041 + + + + + + 2100 

TT AC AAAAGAATGTGAAT AG AAC C AACTC ACG AC AC AAT AGCCATTAT ATTAATCC AAAG 

TTGCTCTTAGAGGATGACAATGTCAAGAGTCTTAGTTC 

2101 + + + + + + 2160 

AACGAGAATCTCCTACTGTTACAGTTCTCAGAATCAAGTG 

CACGAAATGTTGACATGCAATAGCAATAGCAGAAAAAAATGCTTATGTTGGCAG 

2161 + + + + + + 2220 

GTGCTTTACAACTGTACGTTATCGTTATCGTCTTTT^ 

TAGCATGCAATTTTTAGTCCTGATAGTTGATCCAATCATCTO 

2221 + + + + + + 2280 

ATCGTACGTTAAAAATCAGGACTATCAJ^CTAGGTTAGTAGATCAAGTGTTTAACGTTGGC 

TCTCCTAATACCTGTCCTACCAGCATATATGTGTAACATGCCAAGATTAGTAATAACCAT 

2281 + + + + + + 2340 

AGAGGATTATGGACAGGATGGTCGTATATACACATTGTACGGTTCTAATCATT 

TTGAAAACATGGGAAATTTGCAAGGGTAAAAGGAAAATGGAA 

2341 + + + + + + 2400 

AACTTTTGTACCCTTTAAACGTTCCCATTTTC 

AGATAAGCAAAGCCCCTGAATCGAAGACTGTTATTTAAAGTAAGTGTCTTCCCCAGCCAA 

2401 + + + + + + 2460 

TCTATTCGTTTCGGGGACTTAGCTTCTGACAATA 

GAAAATGGCTTAATTCAGTTTTAGAAACATAGAGATTCACATG^ 

2461 + + + + + + 2520 

CTTTTACCGAATTAAGTCAAAATCTTTGTATCTCTAAGTC 

CATATCTTCCCTCACTTTTTAACATTTCTC 

2521 + + + + + + 2580 

GTATAGAAGGGAGTGAAAAATTGTAAAGAG^TGTrTAGTATTTCGAGTAACG 

AGTT ATCAACTAGTTG AT AAC AG AAAGAGTTG 

2581 + + +— 2612 

TGAATAGTTGATCAACTATTGTCTTTCTCAAC 
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